Large Language Model Agent: A Survey on Methodology, Applications and Challenges Paper • 2503.21460 • Published Mar 27 • 79
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving Paper • 2507.06229 • Published Jul 8 • 72
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training Paper • 2508.00414 • Published 13 days ago • 84
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published 28 days ago • 239
Pre-Trained Policy Discriminators are General Reward Models Paper • 2507.05197 • Published Jul 7 • 39
Expanding RL with Verifiable Rewards Across Diverse Domains Paper • 2503.23829 • Published Mar 31 • 24
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper • 2504.21776 • Published Apr 30 • 59
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code By celinah and 3 others • May 23 • 155
Llama Nemotron Collection Open, Production-ready Enterprise Models • 11 items • Updated 14 days ago • 64
Sleep-time Compute: Beyond Inference Scaling at Test-time Paper • 2504.13171 • Published Apr 17 • 15
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published Apr 14 • 280
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 308
SFTvsRL Models & Data Collection This collection contains 4 initial checkpoints for https://github.com/LeslieTrue/SFTvsRL and necessary data for V-IRL training. • 7 items • Updated Mar 13 • 9
GenPRM Collection A collection of GenPRM. Project page: https://ryanliu112.github.io/GenPRM • 6 items • Updated Apr 6 • 5
DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing Paper • 2111.09543 • Published Nov 18, 2021 • 3