GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published 4 days ago • 100
Don't Overthink It: A Survey of Efficient R1-style Large Reasoning Models Paper • 2508.02120 • Published 8 days ago • 15
Are We on the Right Way for Assessing Document Retrieval-Augmented Generation? Paper • 2508.03644 • Published 7 days ago • 22
Are Today's LLMs Ready to Explain Well-Being Concepts? Paper • 2508.03990 • Published 6 days ago • 22
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published 5 days ago • 125
Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents Paper • 2508.01858 • Published 9 days ago • 20
Efficient Agents: Building Effective Agents While Reducing Cost Paper • 2508.02694 • Published 19 days ago • 74
Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning Paper • 2508.03501 • Published 7 days ago • 46
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success Paper • 2508.04280 • Published 6 days ago • 34
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens Paper • 2508.01191 • Published 10 days ago • 192
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation Paper • 2508.03320 • Published 7 days ago • 58
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference Paper • 2508.02193 • Published 8 days ago • 117
InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation Paper • 2507.17520 • Published 20 days ago • 14
SitEmb-v1.5: Improved Context-Aware Dense Retrieval for Semantic Association and Long Story Comprehension Paper • 2508.01959 • Published 8 days ago • 52
Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following Paper • 2508.02150 • Published 8 days ago • 34
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training Paper • 2508.00414 • Published 11 days ago • 83