Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation Paper • 2605.11739 • Published 6 days ago • 49
When to Trust Imagination: Adaptive Action Execution for World Action Models Paper • 2605.06222 • Published 12 days ago • 42
kwoncho/ko-sroberta-korean-time-expression-classifier Token Classification • 0.1B • Updated 12 days ago • 22 • 1
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 16 days ago • 157
IndustryAssetEQA: A Neurosymbolic Operational Intelligence System for Embodied Question Answering in Industrial Asset Maintenance Paper • 2604.23446 • Published 24 days ago • 4
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 27 days ago • 240
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability Paper • 2604.06628 • Published Apr 8 • 324
FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization Paper • 2603.19835 • Published Mar 20 • 351