Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Cerebras
Novita
Replicate
SambaNova
Fireworks
Hyperbolic
Nebius AI Studio
fal
Cohere
Together AI
HF Inference API
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
905
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
lusxvr/nanoVLM-222M
Image-Text-to-Text
•
Updated
6 days ago
•
1.25k
•
66
Qwen/Qwen2.5-Omni-3B
Any-to-Any
•
Updated
14 days ago
•
28.3k
•
207
openbmb/AgentCPM-GUI
Image-Text-to-Text
•
Updated
about 20 hours ago
•
13
•
25
ByteDance-Seed/UI-TARS-1.5-7B
Image-Text-to-Text
•
Updated
26 days ago
•
23.7k
•
255
Qwen/Qwen2.5-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Apr 6
•
3.32M
•
•
883
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
Updated
13 days ago
•
183k
•
1.59k
stepfun-ai/Step1X-Edit
Image-to-Image
•
Updated
1 day ago
•
11
•
265
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
Updated
Feb 6
•
44.6k
•
•
296
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
Updated
Mar 23
•
173k
•
•
449
Qwen/Qwen2.5-VL-3B-Instruct
Image-Text-to-Text
•
Updated
Apr 6
•
2.49M
•
358
turing-motors/Heron-NVILA-Lite-15B
Image-Text-to-Text
•
Updated
13 days ago
•
543
•
7
openfree/Qwen2.5-VL-32B-Instruct-Q4_K_M-GGUF
Image-Text-to-Text
•
Updated
Mar 30
•
1.35k
•
18
openfree/Qwen2.5-VL-32B-Instruct-Q8_0-GGUF
Image-Text-to-Text
•
Updated
Mar 30
•
618
•
17
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
Updated
Jan 25
•
83.3k
•
213
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text
•
Updated
30 days ago
•
334k
•
•
356
openvla/openvla-7b
Image-Text-to-Text
•
Updated
Sep 16, 2024
•
761k
•
113
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
Updated
Feb 6
•
1.44M
•
•
1.19k
allenai/MolmoE-1B-0924
Image-Text-to-Text
•
Updated
19 days ago
•
3.19k
•
143
jinaai/jina-clip-v2
Feature Extraction
•
Updated
16 days ago
•
44.6k
•
225
chenjoya/LiveCC-7B-Instruct
Updated
18 days ago
•
6.81k
•
34
unsloth/Qwen2.5-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
2 days ago
•
6.35k
•
3
osunlp/WebJudge-7B
Image-Text-to-Text
•
Updated
2 days ago
•
19
•
3
ByteDance-Seed/UI-TARS-7B-SFT
Image-Text-to-Text
•
Updated
Jan 25
•
5.22k
•
171
ByteDance-Seed/UI-TARS-72B-SFT
Image-Text-to-Text
•
Updated
Jan 25
•
115
•
19
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text
•
Updated
Jan 25
•
9.34k
•
126
turing-motors/Heron-NVILA-Lite-2B
Image-Text-to-Text
•
Updated
13 days ago
•
486
•
2
turing-motors/Heron-NVILA-Lite-1B
Image-Text-to-Text
•
Updated
13 days ago
•
275
•
2
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text
•
Updated
2 days ago
•
3.9k
•
2
turing-motors/Heron-NVILA-Lite-33B
Image-Text-to-Text
•
Updated
1 day ago
•
30
•
2
imageomics/bioclip
Zero-Shot Image Classification
•
Updated
May 17, 2024
•
34.2k
•
49
Previous
1
2
3
...
31
Next