view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others β’ 7 days ago β’ 136
view article Article Fast LoRA inference for Flux with Diffusers and PEFT By sayakpaul and 1 other β’ 13 days ago β’ 39
view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events By vinid and 6 others β’ 19 days ago β’ 30
view article Article Five Big Improvements to Gradio MCP Servers By freddyaboulton β’ 19 days ago β’ 19
view article Article Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders By orionweller and 5 others β’ 20 days ago β’ 53
view article Article Migrating the Hub from Git LFS to Xet By jsulz and 2 others β’ 21 days ago β’ 23
google/medsiglip-448 Zero-Shot Image Classification β’ 0.9B β’ Updated 26 days ago β’ 14.9k β’ 57
view article Article Upskill your LLMs with Gradio MCP Servers By freddyaboulton β’ 27 days ago β’ 18
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others β’ 28 days ago β’ 606
view article Article Efficient MultiModal Data Pipeline By ariG23498 and 4 others β’ 28 days ago β’ 53
view changelog Changelog Organization and User profiles now include repository listing pages Jun 20 β’ 124
view post Post 1692 The bunch of comparable demos for Multimodal VLMs (excels in OCR, cinematography understanding, spatial reasoning, etc.) now up on the Hub π€ β max recent till Jun'25.β¦ Demo Spaces β > [Nanonets-OCR-s, MonkeyOCR, Typhoon-OCR-7B, SmolDocling] : prithivMLmods/Multimodal-OCR2> [GLM-4.1v, docscopeOCR-7B, MonkeyOCR, coreOCR-7B] : prithivMLmods/core-OCR> [Camel-Doc-OCR, ViLaSR-7B, OCRFlux-3B, ShotVL-7B] : prithivMLmods/Multimodal-VLM-OCR> [SkyCaptioner-V1, SpaceThinker-3B, coreOCR-7B, SpaceOm-3B] : prithivMLmods/VisionScope-R2> [RolmOCR-7B, Qwen2-VL-OCR-2B, Aya-Vision-8B, Nanonets-OCR-s] : prithivMLmods/Multimodal-OCR> [DREX-062225-7B, Typhoon-OCR-3B, olmOCR-7B-0225, VIREX-062225-7B] : prithivMLmods/Multimodal-VLM-Thinking> [Cosmos-Reason1-7B, docscopeOCR-7B, Captioner-7B, visionOCR-3B] : prithivMLmods/DocScope-R1β¦ Space Collection : prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0...To know more about it, visit the model card of the respective model. !! See translation 1 reply Β· π₯ 3 3 π€ 2 2 π 2 2 + Reply