ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning Paper • 2511.14366 • Published 9 days ago • 14
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards Paper • 2511.14659 • Published 9 days ago • 12
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation Paper • 2511.09611 • Published 15 days ago • 67
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR Paper • 2509.23808 • Published Sep 28 • 47
MCP Tools 4 AI Collection A collections of spaces that you can using for building AI with AI via MCP • 7 items • Updated Jun 26 • 37
CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification Paper • 2508.21046 • Published Aug 28 • 9
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21 • 389
xLAM models Collection xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 22 items • Updated 22 days ago • 59
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos Paper • 2507.15597 • Published Jul 21 • 34
The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs Paper • 2507.11097 • Published Jul 15 • 64
view article Article Asynchronous Robot Inference: Decoupling Action Prediction and Execution Jul 10 • 44
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data Jun 3 • 285
ATLAS: Learning to Optimally Memorize the Context at Test Time Paper • 2505.23735 • Published May 29 • 22
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 9 items • Updated Apr 28 • 24