In-Context Reinforcement Learning for Tool Use in Large Language Models Paper • 2603.08068 • Published 10 days ago • 39
Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs Paper • 2603.09095 • Published 10 days ago • 28
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios Paper • 2602.23166 • Published 21 days ago • 44
DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval Paper • 2603.04743 • Published 15 days ago • 51
Heterogeneous Agent Collaborative Reinforcement Learning Paper • 2603.02604 • Published 16 days ago • 185
Timer-S1: A Billion-Scale Time Series Foundation Model with Serial Scaling Paper • 2603.04791 • Published 14 days ago • 16
microsoft/Phi-4-reasoning-vision-15B Image-Text-to-Text • 15B • Updated about 23 hours ago • 22.8k • 156
Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory Paper • 2603.04257 • Published 15 days ago • 19
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 27B • Updated 4 days ago • 322k • 278
Jackrong/Qwen3.5-2B-Claude-4.6-Opus-Reasoning-Distilled-GGUF Text Generation • 2B • Updated 4 days ago • 54k • 114