TingchenFu/reason_general_3k_qwen-2.5-math-1.5b_05311421 Text Generation • 2B • Updated 7 days ago • 4
TingchenFu/general_reason_3k_qwen-2.5-math-1.5b_06021434 Text Generation • 2B • Updated 8 days ago • 4
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.1_bs32lr1.41e-5decay0.0cosine_07070300 Text Classification • Updated Jul 8, 2024 • 3
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.05_bs32lr1.41e-5decay0.0cosine_07070300 Text Classification • Updated Jul 8, 2024 • 3
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.02_bs32lr1.41e-5decay0.0cosine_07070257 Text Classification • Updated Jul 8, 2024 • 3 • 1
TingchenFu/RM_gpt2-large_HH_bf16_harmless0.01_bs32lr1.41e-5decay0.0cosine_07070257 Text Classification • Updated Jul 8, 2024 • 3