arxiv:2403.13031
Zidi Xiong
polaris-73
·
AI & ML interests
None yet
Recent Activity
published
a model
7 days ago
polaris-73/ds8b_grpo_math_gsm8k_reinforce-global_step_100
updated
a model
7 days ago
polaris-73/ds8b_grpo_math_gsm8k_rloo-global_step_700
published
a model
7 days ago
polaris-73/ds8b_grpo_math_gsm8k_rloo-global_step_700