Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
Joya Chen PRO
chenjoya
AI & ML interests
Video LLM
Recent Activity
liked
a model
1 day ago
Qwen/Qwen-Image
updated
a dataset
3 days ago
chenjoya/Live-WhisperX-526K
liked
a dataset
14 days ago
keithito/lj_speech