ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer Paper • 2603.03583 • Published 7 days ago • 1
Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models Paper • 2603.07777 • Published 3 days ago • 4
Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems Paper • 2603.07779 • Published 3 days ago • 4
Unlocking Data Value in Finance: A Study on Distillation and Difficulty-Aware Training Paper • 2603.07223 • Published 4 days ago • 12
NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper • 2603.08397 • Published 2 days ago • 14
Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large Toolspaces Paper • 2603.06713 • Published 5 days ago • 13
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation Paper • 2603.08652 • Published 1 day ago • 29
nabla-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space Paper • 2603.04948 • Published 6 days ago • 1
Reasoning Models Struggle to Control their Chains of Thought Paper • 2603.05706 • Published 5 days ago • 24
Specificity-aware reinforcement learning for fine-grained open-world classification Paper • 2603.03197 • Published 8 days ago • 13
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Paper • 2603.04257 • Published 7 days ago • 18
MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning Paper • 2603.03379 • Published 8 days ago • 27
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning Paper • 2603.03790 • Published 7 days ago • 112
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 8 days ago • 160
Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data Paper • 2602.21320 • Published 15 days ago • 11