MeDAL: Medical Abbreviation Disambiguation Dataset for Natural Language Understanding Pretraining Paper • 2012.13978 • Published Dec 27, 2020 • 1
view article Article How to Train Your LLM Web Agent: A Statistical Diagnosis By ppEmiliano • 28 days ago • 13
How to Train Your LLM Web Agent: A Statistical Diagnosis Paper • 2507.04103 • Published about 1 month ago • 46
How to Train Your LLM Web Agent: A Statistical Diagnosis Paper • 2507.04103 • Published about 1 month ago • 46 • 3
LineRetriever: Planning-Aware Observation Reduction for Web Agents Paper • 2507.00210 • Published Jun 30 • 6
LineRetriever: Planning-Aware Observation Reduction for Web Agents Paper • 2507.00210 • Published Jun 30 • 6
view article Article MIEB: The Benchmark That Stress-Tests Image-Text Embeddings Like Never Before By isaacchung and 2 others • Apr 24 • 14
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories Paper • 2504.08942 • Published Apr 11 • 27