Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory Paper • 2508.09736 • Published 1 day ago • 19
WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent Paper • 2508.05748 • Published 7 days ago • 102
Matrix-3D: Omnidirectional Explorable 3D World Generation Paper • 2508.08086 • Published 3 days ago • 62
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 6 days ago • 134
MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published 3 days ago • 33
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 13 days ago • 204
DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning Paper • 2508.05405 • Published 7 days ago • 61
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation Paper • 2508.05635 • Published 7 days ago • 67
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published 7 days ago • 138