Repo for paper Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability.
Qihan Ren
jasonrqh
AI & ML interests
explainable AI, LLM
Recent Activity
commentedon a paper about 2 hours ago
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability liked a model about 5 hours ago
MiniMaxAI/MiniMax-M2.7 updated a collection about 23 hours ago
Rethink_SFT_generalization