Slow-Fast Video MLLM shi-labs/slowfast-video-mllm-qwen2-7b-convnext-576-frame64-s1t4 Video-Text-to-Text • 9B • Updated Apr 9 • 27 shi-labs/slowfast-video-mllm-qwen2-7b-convnext-576-frame96-s1t6 Video-Text-to-Text • 9B • Updated Apr 9 • 13 shi-labs/slowfast-video-mllm-qwen2-7b-convnext-576-frame128-s2t4 9B • Updated Apr 4 • 7
shi-labs/slowfast-video-mllm-qwen2-7b-convnext-576-frame64-s1t4 Video-Text-to-Text • 9B • Updated Apr 9 • 27
shi-labs/slowfast-video-mllm-qwen2-7b-convnext-576-frame96-s1t6 Video-Text-to-Text • 9B • Updated Apr 9 • 13
Creative AI Generative AI for visual creativity Runtime error 409 409 Versatile Diffusion 🚀 Runtime error 166 166 StreamingT2V 🔥 Consistent, Dynamic, and Extendable Long Video Generation Runtime error 378 378 Text2Video-Zero 🚀 Runtime error 91 91 Prompt-Free Diffusion 👀
Visual Understanding Accurate & efficient vision models, ops and systems Runtime error 471 471 OneFormer 🎗 Segment images into parts and maps Runtime error 56 56 Matting Anything 📈 shi-labs/oneformer_coco_swin_large Image Segmentation • Updated Jan 19, 2023 • 29.2k • • 5 shi-labs/oneformer_ade20k_swin_large Image Segmentation • Updated Jan 19, 2023 • 140k • • 31
Multimodal AI Large multimodal models Running on Zero 5 5 VisPer-LM 🔍 Generate detailed visual insights from images and text Running on Zero 82 82 CuMo 7b Zero 🐐 Generate text based on images and input text Runtime error 63 63 VCoder ✌ shi-labs/vcoder_ds_llava-v1.5-13b Text Generation • Updated Dec 20, 2023 • 15 • 4
Slow-Fast Video MLLM shi-labs/slowfast-video-mllm-qwen2-7b-convnext-576-frame64-s1t4 Video-Text-to-Text • 9B • Updated Apr 9 • 27 shi-labs/slowfast-video-mllm-qwen2-7b-convnext-576-frame96-s1t6 Video-Text-to-Text • 9B • Updated Apr 9 • 13 shi-labs/slowfast-video-mllm-qwen2-7b-convnext-576-frame128-s2t4 9B • Updated Apr 4 • 7
shi-labs/slowfast-video-mllm-qwen2-7b-convnext-576-frame64-s1t4 Video-Text-to-Text • 9B • Updated Apr 9 • 27
shi-labs/slowfast-video-mllm-qwen2-7b-convnext-576-frame96-s1t6 Video-Text-to-Text • 9B • Updated Apr 9 • 13
Visual Understanding Accurate & efficient vision models, ops and systems Runtime error 471 471 OneFormer 🎗 Segment images into parts and maps Runtime error 56 56 Matting Anything 📈 shi-labs/oneformer_coco_swin_large Image Segmentation • Updated Jan 19, 2023 • 29.2k • • 5 shi-labs/oneformer_ade20k_swin_large Image Segmentation • Updated Jan 19, 2023 • 140k • • 31
Creative AI Generative AI for visual creativity Runtime error 409 409 Versatile Diffusion 🚀 Runtime error 166 166 StreamingT2V 🔥 Consistent, Dynamic, and Extendable Long Video Generation Runtime error 378 378 Text2Video-Zero 🚀 Runtime error 91 91 Prompt-Free Diffusion 👀
Multimodal AI Large multimodal models Running on Zero 5 5 VisPer-LM 🔍 Generate detailed visual insights from images and text Running on Zero 82 82 CuMo 7b Zero 🐐 Generate text based on images and input text Runtime error 63 63 VCoder ✌ shi-labs/vcoder_ds_llava-v1.5-13b Text Generation • Updated Dec 20, 2023 • 15 • 4