Beyond the Trade-off: Self-Supervised Reinforcement Learning for Reasoning Models' Instruction Following Paper • 2508.02150 • Published 2 days ago • 31
HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels Paper • 2507.21809 • Published 8 days ago • 112
WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization Paper • 2507.15061 • Published 17 days ago • 48 • 5
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination Paper • 2507.10532 • Published 23 days ago • 84
RAIF Collection Datasets and models in the paper "Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models" [github.com/yuleiqin/RAIF]. • 12 items • Updated 20 days ago • 1