wenwenD/qwen3-4b-codeexp_grpo_no_prior_think_step280_2026-01-25_06-29-13_nvidia_balanced 4B • Updated about 7 hours ago
wenwenD/qwen3-4b-codeexp_grpo_w_prior_think_step280_2026-01-25_06-28-54_nvidia_balanced 4B • Updated about 7 hours ago
wenwenD/qwen3-4b-codeexp_grpo_with_prior_think_step280_2026-01-24_07-19-57_nvidia 4B • Updated 1 day ago • 14
wenwenD/qwen3-4b-codeexp_grpo_no_prior_think_step280_2026-01-24_07-21-36_nvdia 4B • Updated 1 day ago • 14
wenwenD/qwen3-4b-codeexp_grpo_w_prior_think_discount_always1_step175_2026_01_23_21_40_33 4B • Updated 1 day ago • 8
wenwenD/qwen7B-instruct-repo_sft_3epcs_w_context-synthetic_multiturn_sft_3epcs 8B • Updated Jun 16, 2025 • 1