HuggingFaceM4

company

AI & ML interests

None defined yet.

Recent Activity

andito new activity 15 days ago

HuggingFaceM4/faster-qwen3-tts-demo:error: Offset increment outside graph capture :(

andito new activity 15 days ago

HuggingFaceM4/faster-qwen3-tts-demo:technical details, how is it achieved ?

andito updated a Space 16 days ago

HuggingFaceM4/faster-qwen3-tts-demo

View all activity

Organization Card

Community About org cards

HuggingFaceM4 is the multimodal team at Hugging Face, working on vision-language models.

Within this organization on the Hugging Face hub, you can access the Idefics models (version 1 IDEFICS, version 2 Idefics2, version 3 Idefics3), datasets used for the training like OBELICS, WebSight, The Cauldron or Docmatix, and interactive tools to visualize the results.

Collections 5

View 5 collections

spaces 20

IDEFICS Playground

faster-qwen3-tts

Generate spoken audio from text with custom or cloned voices

Reachy Mini Remote Control (Multi-User)

Remote control for Reachy Mini robots with authentication

Reachy Mini Key Claim

Request an ephemeral API key using an order number

Gradium Setup

Little space to improve the onboarding to gradium

FineVision: Open Data is All You Need

A new open-source dataset for training VLMs

models 34

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • Updated Dec 2, 2024 • 175k • 302

HuggingFaceM4/Florence-2-DocVQA

Image-Text-to-Text • 0.8B • Updated Oct 30, 2024 • 768 • 65

HuggingFaceM4/idefics2-8b

Image-Text-to-Text • 8B • Updated Oct 14, 2024 • 131k • 620

HuggingFaceM4/idefics2-8b-base

Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 1.63k • 28

HuggingFaceM4/idefics2-8b-chatty

Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 155 • 95

HuggingFaceM4/siglip-so400m-14-364-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated Jul 27, 2024 • 6 • 1

HuggingFaceM4/siglip-so400m-14-700-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated Jun 13, 2024 • 3 • 2

HuggingFaceM4/siglip-so400m-14-384-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated May 9, 2024 • 6 • 1

HuggingFaceM4/idefics2-8b-chatty-AWQ

Image-Text-to-Text • 8B • Updated May 6, 2024 • 9 • 5

HuggingFaceM4/idefics2-8b-AWQ

Image-Text-to-Text • 8B • Updated May 6, 2024 • 21 • 26

datasets 82

HuggingFaceM4/FineVisionMax

Viewer • Updated Oct 21, 2025 • 24.2M • 22.2k • 22

HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 132k • 475

HuggingFaceM4/lmms-eval-embeddings

Updated Sep 3, 2025 • 446 • 1

HuggingFaceM4/DoclingMatix

Viewer • Updated Jul 31, 2025 • 1.27M • 1.72k • 48

HuggingFaceM4/Caltech-101

Updated Sep 10, 2024 • 293 • 4

HuggingFaceM4/Docmatix

Viewer • Updated Aug 26, 2024 • 2.55M • 14.3k • 299

HuggingFaceM4/the_cauldron

Viewer • Updated May 6, 2024 • 1.88M • 58.1k • 519

HuggingFaceM4/FairFace

Viewer • Updated Apr 11, 2024 • 195k • 1.13k • 28

HuggingFaceM4/MMBench

Viewer • Updated Apr 5, 2024 • 11k • 2.02k • 4

HuggingFaceM4/WebSight

Viewer • Updated Mar 26, 2024 • 2.75M • 5.54k • 386

View 82 datasets