Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published Mar 23 • 125
Running on CPU Upgrade Agents Featured 1.38k Open ASR Leaderboard 🏆 1.38k Explore and compare speech recognition model benchmarks
daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published Jan 26 • 126
RAPO++: Cross-Stage Prompt Optimization for Text-to-Video Generation via Data Alignment and Test-Time Scaling Paper • 2510.20206 • Published Oct 23, 2025 • 12
Running Agents Featured 135 Open VLM Video Leaderboard 🌎 135 VLMEvalKit Eval Results in video understanding benchmark
Running on CPU Upgrade Agents 1.02k Open VLM Leaderboard 🌎 1.02k VLMEvalKit Evaluation Results Collection
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots
view article Article Smol2Operator: Post-Training GUI Agents for Computer Use +3 A-Mahla, merve, sergiopaniego, reach-vb, lewtun • Sep 23, 2025 • 138