A collection of audio related papers that I want to read
-
LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
Paper • 2502.20583 • Published • 13 -
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant
Paper • 2410.15316 • Published • 12 -
Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Paper • 2503.01710 • Published • 6