DreamX-World 1.0: A General-Purpose Interactive World Model Paper • 2606.16993 • Published 1 day ago • 61
JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 7 days ago • 151
InterleaveThinker: Reinforcing Agentic Interleaved Generation Paper • 2606.13679 • Published 6 days ago • 78
Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models Paper • 2605.21573 • Published 28 days ago • 110
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published May 14 • 145
Lance: Unified Multimodal Modeling by Multi-Task Synergy Paper • 2605.18678 • Published 30 days ago • 78
InsightTok: Improving Text and Face Fidelity in Discrete Tokenization for Autoregressive Image Generation Paper • 2605.14333 • Published May 14 • 35
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper • 2604.14268 • Published Apr 15 • 124
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published Apr 15 • 165