Learning Adaptive Parallel Reasoning with Language Models Paper • 2504.15466 • Published 22 days ago • 42
Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis Paper • 2503.08741 • Published Mar 11 • 1
$\texttt{Complex-Edit}$: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark Paper • 2504.13143 • Published 26 days ago • 8
Sonata: Self-Supervised Learning of Reliable Point Representations Paper • 2503.16429 • Published Mar 20 • 11
Sonata: Self-Supervised Learning of Reliable Point Representations Paper • 2503.16429 • Published Mar 20 • 11
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks Paper • 2503.15478 • Published Mar 19 • 10
reWordBench: Benchmarking and Improving the Robustness of Reward Models with Transformed Inputs Paper • 2503.11751 • Published Mar 14 • 16