Whisper
📉
2.71k
Transcribe audio or YouTube video to text
Whisper includes both English-only and multilingual checkpoints for ASR and ST, ranging from 38M params for the tiny models to 1.5B params for large.
Transcribe audio or YouTube video to text