openai/clip-vit-large-patch14 Zero-Shot Image Classification β’ 0.4B β’ Updated Sep 15, 2023 β’ 8.18M β’ 1.94k
microsoft/Phi-3-vision-128k-instruct Text Generation β’ 4B β’ Updated 20 days ago β’ 21.2k β’ 969
Running on Zero MCP Featured 139 Multimodal OCR2 π» 139 nanonets ocr / smoldocling / monkey ocr / typhoon ocr
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ 0.3B β’ Updated Sep 17 β’ 59.8k β’ 1.6k
Running on Zero 16 Explainable-Vision-Language-Model π₯Ά 16 Generate a video visualizing how a model attends to an image while generating text
google/vit-base-patch16-224 Image Classification β’ 86.6M β’ Updated Sep 5, 2023 β’ 4.67M β’ β’ 912