microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition β’ 6B β’ Updated May 1 β’ 414k β’ 1.47k
Running 320 320 Kokoro Text-to-Speech (WebGPU) π£ High-quality speech synthesis powered by Kokoro TTS
meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text β’ 11B β’ Updated Dec 4, 2024 β’ 821k β’ β’ 1.5k