Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated about 1 month ago • 115
AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy Paper • 2506.13284 • Published Jun 16 • 24
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning Paper • 2505.16400 • Published May 22 • 33
AceReason Collection Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 22 days ago • 14
AceMath-RL Collection Math reasoning models trained through reinforcement learning (RL) • 1 item • Updated 22 days ago • 4