Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Inference Providers
Select all
Cerebras
Fireworks
Together AI
Nebius AI
Novita
Featherless AI
Hyperbolic
Nscale
Groq
SambaNova
fal
Cohere
Replicate
HF Inference API
Misc
Reset Misc
image-to-text
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
8,961
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
JackChew/Qwen2-VL-2B-OCR
Image-to-Text
•
2B
•
Updated
Dec 29, 2024
•
2.08k
•
11
HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text
•
0.3B
•
Updated
Apr 8
•
69k
•
254
Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Apr 6
•
3.73M
•
455
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Jun 6
•
345k
•
•
512
nvidia/Cosmos-Reason1-7B
Image-to-Text
•
8B
•
Updated
Jun 11
•
192k
•
109
VLM2Vec/VLM2Vec-V2.0
Image-to-Text
•
Updated
3 days ago
•
2.51k
•
7
Hcompany/Holo1-7B
Image-Text-to-Text
•
8B
•
Updated
Jun 10
•
16.5k
•
217
BAAI/RoboBrain2.0-7B
Robotics
•
8B
•
Updated
12 days ago
•
5.01k
•
92
HelloKKMe/GTA1-72B
Image-to-Text
•
73B
•
Updated
8 days ago
•
185
•
4
allura-org/MS3.2-24b-Angel
Image-to-Text
•
24B
•
Updated
8 days ago
•
51
•
6
prithivMLmods/Megalodon-OCR-Sync-0713
Image-Text-to-Text
•
4B
•
Updated
2 days ago
•
20
•
3
microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
Feb 11
•
267k
•
417
google/deplot
Visual Question Answering
•
0.3B
•
Updated
Sep 6, 2023
•
13.2k
•
303
techietrader/captcha_ocr
Image-to-Text
•
Updated
Jun 6, 2024
•
17
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Jan 12
•
1.14M
•
433
Alibaba-NLP/gme-Qwen2-VL-2B-Instruct
Sentence Similarity
•
2B
•
Updated
Jun 9
•
10.1k
•
89
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text
•
33B
•
Updated
Apr 14
•
430k
•
•
404
Video-R1/Video-R1-7B
Video-Text-to-Text
•
8B
•
Updated
May 12
•
5.72k
•
12
remyxai/SpaceThinker-Qwen2.5VL-3B
Image-Text-to-Text
•
4B
•
Updated
25 days ago
•
4.07k
•
24
remyxai/SpaceOm
Image-Text-to-Text
•
4B
•
Updated
10 days ago
•
1.92k
•
11
csfufu/Revisual-R1-final
Image-Text-to-Text
•
8B
•
Updated
2 days ago
•
1.49k
•
7
Hcompany/Holo1-3B
Image-Text-to-Text
•
4B
•
Updated
Jun 10
•
7.1k
•
82
numind/NuExtract-2.0-4B
Image-Text-to-Text
•
4B
•
Updated
21 days ago
•
1.25k
•
10
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text
•
8B
•
Updated
Jun 7
•
19.1k
•
155
PaddlePaddle/PP-OCRv5_server_det
Image-to-Text
•
Updated
20 days ago
•
52.4k
•
9
prithivMLmods/Camel-Doc-OCR-062825
Image-Text-to-Text
•
8B
•
Updated
19 days ago
•
1.4k
•
10
prithivMLmods/WR30a-Deep-7B-0711
Image-Text-to-Text
•
8B
•
Updated
5 days ago
•
21
•
2
prithivMLmods/Lh41-1042-Magellanic-7B-0711
Image-Text-to-Text
•
8B
•
Updated
5 days ago
•
25
•
2
prithivMLmods/Perseus-Doc-vl-0712
Image-Text-to-Text
•
8B
•
Updated
3 days ago
•
11
•
2
microsoft/trocr-base-printed
Image-to-Text
•
0.3B
•
Updated
May 27, 2024
•
238k
•
180
Previous
1
2
3
4
...
100
Next