Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

vision-language

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

352

Full-text search

Active filters: vision-language

mhsarkar/stepfunai_GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Jan 6 • 2

tahamajs/plamma

Updated Feb 9 • 2 • 3

JerrryNie/ConceptCLIP

Feature Extraction • 0.5B • Updated May 4 • 188 • 3

hateslopacademy/otpensource-vision

Text Classification • Updated Feb 3 • 2 • 1

remyxai/SpaceQwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Jun 28 • 2.12k • 15

hateslopacademy/otpensource-vision-lora

Text Classification • Updated Feb 3

Hibernates/Hibernates-JP-1.3b-Max

2B • Updated Feb 9 • 2 • 2

lxasqjc/lavender-llama-3.2-11b-lora

Image-Text-to-Text • Updated Feb 17 • 2

Mattimax/DATA-AI_Smol256M-Instruct

0.3B • Updated Feb 16 • 4

ctranslate2-4you/GOT-OCR2_0-Customized

Image-Text-to-Text • 0.7B • Updated Feb 17 • 3

sbintuitions/sarashina2-vision-8b

Image-to-Text • 8B • Updated Mar 27 • 396 • 6

sbintuitions/sarashina2-vision-14b

Image-to-Text • 14B • Updated Mar 27 • 3.39k • 8

UMA-IA/AQUILA-Engine-v1

Image-to-Text • 8B • Updated Mar 16 • 4 • 1

aihpi/food-waste-vlm

8B • Updated Apr 1 • 3

jpark677/internvl2-8b-mmbench-lora-ep-1-waa-false

Image-to-Text • 8B • Updated Apr 3 • 5

jpark677/internvl2-8b-mmbench-lora-ep-2-waa-false

Image-to-Text • 8B • Updated Apr 3 • 3

mradermacher/SpaceQwen2.5-VL-3B-Instruct-GGUF

Robotics • 3B • Updated 8 days ago • 124 • 1

mradermacher/SpaceQwen2.5-VL-3B-Instruct-i1-GGUF

Robotics • 3B • Updated 28 days ago • 297 • 1

mradermacher/AQUILA-Engine-v1-GGUF

8B • Updated 8 days ago • 75

mradermacher/AQUILA-Engine-v1-i1-GGUF

8B • Updated 28 days ago • 200

TheEighthDay/SeekWorld_RL_PLUS

8B • Updated Apr 19 • 236 • 1

mradermacher/SeekWorld_RL_PLUS-GGUF

8B • Updated 8 days ago • 114

nkkbr/ViCA-ARKitScenes

Video-Text-to-Text • 8B • Updated May 7 • 4

nkkbr/ViCA-ScanNet

Video-Text-to-Text • 8B • Updated May 7 • 4

nkkbr/ViCA-base

Video-Text-to-Text • 8B • Updated May 7 • 4

nkkbr/ViCA

Video-Text-to-Text • 8B • Updated May 28 • 8

nkkbr/ViCA-ScanNetPP

Video-Text-to-Text • 8B • Updated May 7 • 4

nkkbr/ViCA2-stage1-align

Video-Text-to-Text • 8B • Updated May 15 • 5

nkkbr/ViCA2-stage2-onevision-ft

Video-Text-to-Text • 8B • Updated May 15 • 6

nkkbr/ViCA2

Video-Text-to-Text • 8B • Updated May 28 • 30