Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Cerebras
Nebius AI
Fireworks
Together AI
SambaNova
Novita
Groq
Nscale
+ 6
Apply filters
Models
6,455
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
8B
•
Updated
Oct 14, 2024
•
37.8k
•
614
OpenGVLab/InternVL-Chat-V1-5
Image-Text-to-Text
•
26B
•
Updated
Mar 25
•
2.04k
•
415
xtuner/llava-llama-3-8b
Image-Text-to-Text
•
8B
•
Updated
Apr 26, 2024
•
26
•
38
google/paligemma-3b-ft-nlvr2-448
Image-Text-to-Text
•
3B
•
Updated
Jul 19, 2024
•
2.54k
•
1
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
•
9B
•
Updated
Jan 15
•
73.7k
•
1.4k
openvla/openvla-7b
Image-Text-to-Text
•
8B
•
Updated
Sep 16, 2024
•
366k
•
143
microsoft/Florence-2-base
Image-Text-to-Text
•
0.2B
•
Updated
Aug 4
•
942k
•
293
microsoft/Florence-2-large-ft
Image-Text-to-Text
•
0.8B
•
Updated
Aug 4
•
27.4k
•
366
microsoft/Florence-2-base-ft
Image-Text-to-Text
•
0.2B
•
Updated
Aug 4
•
41.1k
•
131
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
0.7B
•
Updated
Feb 4
•
29.1k
•
1.52k
meta-llama/Llama-Guard-3-11B-Vision
Image-Text-to-Text
•
11B
•
Updated
Nov 18, 2024
•
2.77k
•
63
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Apr 4
•
32.5k
•
546
rhymes-ai/Aria
Image-Text-to-Text
•
25B
•
Updated
Apr 23
•
60.3k
•
635
hyf015/Vinci-8B-base
Image-Text-to-Text
•
8B
•
Updated
Oct 18, 2024
•
8
•
2
google/paligemma2-3b-mix-224
Image-Text-to-Text
•
3B
•
Updated
Feb 7
•
12.2k
•
35
google/paligemma2-3b-pt-224
Image-Text-to-Text
•
3B
•
Updated
Dec 5, 2024
•
626k
•
157
ndkhanh95/Paligemma
Image-Text-to-Text
•
3B
•
Updated
Dec 24, 2024
•
11
•
1
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text
•
3B
•
Updated
Dec 18, 2024
•
75.7k
•
213
nvidia/Eagle2-2B
Image-Text-to-Text
•
2B
•
Updated
Apr 27
•
1.37k
•
30
nvidia/Eagle2-1B
Image-Text-to-Text
•
1B
•
Updated
Apr 27
•
3.24k
•
25
ByteDance-Seed/UI-TARS-2B-SFT
Image-Text-to-Text
•
2B
•
Updated
Jan 25
•
14.9k
•
26
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
8B
•
Updated
Jan 25
•
5.38k
•
220
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text
•
73B
•
Updated
Jan 25
•
3.58k
•
143
mlx-community/Qwen2.5-VL-72B-Instruct-3bit
Image-Text-to-Text
•
10B
•
Updated
Feb 25
•
57
•
5
HuanjinYao/Mulberry_qwen2vl_7b
Image-Text-to-Text
•
8B
•
Updated
Feb 4
•
207
•
2
AIDC-AI/Ovis2-8B
Image-Text-to-Text
•
9B
•
Updated
30 days ago
•
81.5k
•
74
HuggingFaceTB/SmolVLM2-500M-Video-Instruct
Image-Text-to-Text
•
0.5B
•
Updated
Apr 8
•
121k
•
93
Fancy-MLLM/R1-Onevision-7B
Image-Text-to-Text
•
8B
•
Updated
Feb 25
•
1.38k
•
44
Qwen/Qwen2.5-VL-3B-Instruct-AWQ
Image-Text-to-Text
•
1B
•
Updated
Apr 6
•
439k
•
56
huihui-ai/Qwen2.5-VL-3B-Instruct-abliterated
Image-Text-to-Text
•
4B
•
Updated
Jul 14
•
1.62k
•
17
Previous
1
...
3
4
5
6
7
...
100
Next