agent training - a MercedeSnape Collection

MercedeSnape 's Collections

Technical Report

Problem Definition

reasoning evaluation

agent reasoning

agent training

updated 4 days ago

Don't Just Fine-tune the Agent, Tune the Environment

Paper • 2510.10197 • Published Oct 11, 2025 • 30

Note 从问题实例而非SFT / RL 方法post-training