Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
Novita
Together AI
Nebius AI Studio
fal
Replicate
Cerebras
Cohere
Nscale
Hyperbolic
SambaNova
HF Inference API
Misc
Reset Misc
Inference Endpoints
text-generation-inference
image-text-to-text
custom_code
4-bit precision
Merge
8-bit precision
Eval Results
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
10,791
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
Salesforce/blip-image-captioning-large
Image-to-Text
•
Updated
Feb 3
•
1.86M
•
1.34k
vikhyatk/moondream2
Image-Text-to-Text
•
Updated
Apr 14
•
367k
•
1.14k
mistralai/Pixtral-12B-2409
Image-Text-to-Text
•
Updated
Dec 26, 2024
•
•
643
unsloth/medgemma-4b-it-GGUF
Image-Text-to-Text
•
Updated
10 days ago
•
16.5k
•
16
ngxson/Devstral-Small-Vision-2505-GGUF
Image-Text-to-Text
•
Updated
9 days ago
•
1.03k
•
22
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text
•
Updated
Feb 25
•
368k
•
654
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text
•
Updated
Apr 11
•
109k
•
136
google/gemma-3-4b-it-qat-q4_0-gguf
Image-Text-to-Text
•
Updated
Apr 11
•
8.27k
•
152
openbmb/AgentCPM-GUI
Image-Text-to-Text
•
Updated
3 days ago
•
1.07k
•
115
daniel3303/QwenStoryteller
Image-to-Text
•
Updated
14 days ago
•
113
•
7
One-RL-to-See-Them-All/Orsta-7B
Image-Text-to-Text
•
Updated
4 days ago
•
146
•
6
mlabonne/gemma-3-27b-it-qat-abliterated
Image-Text-to-Text
•
Updated
about 21 hours ago
•
1
•
6
Salesforce/blip2-opt-2.7b
Image-Text-to-Text
•
Updated
Feb 3
•
860k
•
375
microsoft/Florence-2-large
Image-Text-to-Text
•
Updated
Dec 8, 2024
•
635k
•
1.56k
unsloth/gemma-3-4b-it-GGUF
Image-Text-to-Text
•
Updated
18 days ago
•
51k
•
99
meta-llama/Llama-Guard-4-12B
Image-Text-to-Text
•
Updated
about 1 month ago
•
60.4k
•
37
kingabzpro/medgemma-brain-cancer
Image-Text-to-Text
•
Updated
3 days ago
•
5
mlabonne/gemma-3-12b-it-qat-abliterated
Image-Text-to-Text
•
Updated
about 21 hours ago
•
2
•
5
liuhaotian/llava-v1.5-7b
Image-Text-to-Text
•
Updated
May 8, 2024
•
1.02M
•
464
llava-hf/LLaVA-NeXT-Video-7B-hf
Video-Text-to-Text
•
Updated
Jan 27
•
118k
•
99
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
Updated
Feb 4
•
151k
•
1.48k
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
Updated
Feb 6
•
40.4k
•
•
301
Minthy/ToriiGate-v0.4-7B
Image-Text-to-Text
•
Updated
Jan 22
•
752
•
44
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
Updated
Apr 8
•
39.8k
•
150
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
Updated
Apr 6
•
134k
•
69
unsloth/gemma-3-12b-it-GGUF
Image-Text-to-Text
•
Updated
18 days ago
•
52k
•
75
mlabonne/gemma-3-27b-it-abliterated-GGUF
Image-Text-to-Text
•
Updated
Apr 1
•
15.2k
•
87
google/gemma-3-27b-it-qat-q4_0-gguf
Image-Text-to-Text
•
Updated
Apr 11
•
14.8k
•
291
Tesslate/Synthia-S1-27b
Image-Text-to-Text
•
Updated
Apr 9
•
385
•
•
73
meta-llama/Llama-4-Scout-17B-16E
Image-Text-to-Text
•
Updated
Apr 9
•
32.9k
•
170
Previous
1
2
3
4
...
100
Next