hdong0/Qwen2.5-1.5B-Open-R1-Distill_deepmath_bottom_10epoch Text Generation • 2B • Updated May 22 • 2
hdong0/Qwen2.5-Math-1.5B-batch-mix-Open-R1-GRPO_100steps_lr1e-6 Text Generation • 2B • Updated May 24 • 3