Audio stabilityai/stable-audio-open-small Text-to-Audio • 0.5B • Updated May 27 • 4.73k • 199
Play-Ground Running 184 184 Inference Playground 🔋 Toggle dark/light theme on Hugging Face Playground
Spatial manycore-research/SpatialLM-Llama-1B Text Generation • 1B • Updated Mar 21 • 3.1k • 967
Multimode microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 551k • 1.44k ByteDance/Sa2VA-8B Image-Text-to-Text • 8B • Updated Mar 19 • 1.41k • 59
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 551k • 1.44k
Speako ibm-granite/granite-speech-3.2-8b Automatic Speech Recognition • 8B • Updated Apr 16 • 1.88k • 80 ByteDance/MegaTTS3 Text-to-Speech • Updated Apr 4 • 734 • 384 Running Demo 🚀 Transcribe audio/video to text
ibm-granite/granite-speech-3.2-8b Automatic Speech Recognition • 8B • Updated Apr 16 • 1.88k • 80
Imagen black-forest-labs/FLUX.1-dev Text-to-Image • Updated about 21 hours ago • 1.65M • • 10.7k
Code all-hands/openhands-lm-32b-v0.1 Text Generation • 33B • Updated Apr 16 • 3.48k • • 384
Audio stabilityai/stable-audio-open-small Text-to-Audio • 0.5B • Updated May 27 • 4.73k • 199
Speako ibm-granite/granite-speech-3.2-8b Automatic Speech Recognition • 8B • Updated Apr 16 • 1.88k • 80 ByteDance/MegaTTS3 Text-to-Speech • Updated Apr 4 • 734 • 384 Running Demo 🚀 Transcribe audio/video to text
ibm-granite/granite-speech-3.2-8b Automatic Speech Recognition • 8B • Updated Apr 16 • 1.88k • 80
Play-Ground Running 184 184 Inference Playground 🔋 Toggle dark/light theme on Hugging Face Playground
Imagen black-forest-labs/FLUX.1-dev Text-to-Image • Updated about 21 hours ago • 1.65M • • 10.7k
Spatial manycore-research/SpatialLM-Llama-1B Text Generation • 1B • Updated Mar 21 • 3.1k • 967
Code all-hands/openhands-lm-32b-v0.1 Text Generation • 33B • Updated Apr 16 • 3.48k • • 384
Multimode microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 551k • 1.44k ByteDance/Sa2VA-8B Image-Text-to-Text • 8B • Updated Mar 19 • 1.41k • 59
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 551k • 1.44k