Separate audio into vocals, bass, drums, and other
Transcribe audio/video to text and generate SRT subtitles