LaViT: Aligning Latent Visual Thoughts for Multi-modal Reasoning Paper • 2601.10129 • Published 4 days ago • 6
LSRIF: Logic-Structured Reinforcement Learning for Instruction Following Paper • 2601.06431 • Published 9 days ago • 7
Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding Paper • 2601.10611 • Published 3 days ago • 19
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published 5 days ago • 124
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 3 days ago • 147