-
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper • 2510.14528 • Published • 111 -
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text • 3B • Updated • 3.15M • 3.07k -
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 12.3k • 1.48k -
nanonets/Nanonets-OCR2-3B
Image-Text-to-Text • 4B • Updated • 83.5k • 476
www.minds.com/jelyazko/
21world
AI & ML interests
Who not work will not Eat
Recent Activity
updated
a collection
about 19 hours ago
18\ other models
upvoted
a
paper
about 20 hours ago
ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands
updated
a collection
1 day ago
32\ video creation