Transcribe audio to text
Display a loading screen with Hugging Face logo
Generate a music video with AI-generated voice