mlfoundations-dev/Qwen2.5-7B-Instruct_qwq_mix_qwen3_science Text Generation • 8B • Updated about 19 hours ago
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr16e5_epochs5 Text Generation • 2B • Updated 3 days ago • 83
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr2e5_epochs5 Text Generation • 2B • Updated 3 days ago • 81
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr8e5_epochs5 Text Generation • 2B • Updated 3 days ago • 83
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz1024_lr16e5_epochs5 Text Generation • 2B • Updated 3 days ago • 78
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz1024_lr8e5_epochs5 Text Generation • 2B • Updated 4 days ago • 83
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz1024_lr4e5_epochs5 Text Generation • 2B • Updated 4 days ago • 80
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz1024_lr2e5_epochs5 Text Generation • 2B • Updated 4 days ago • 82
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz512_lr8e5_epochs5 Text Generation • 2B • Updated 4 days ago • 83
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz512_lr4e5_epochs5 Text Generation • 2B • Updated 4 days ago • 77
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz512_lr16e5_epochs5 Text Generation • 2B • Updated 4 days ago • 81
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr16e5_epochs7 Text Generation • 2B • Updated 4 days ago • 84
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr8e5_epochs7 Text Generation • 2B • Updated 4 days ago • 84
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr2e5_epochs7 Text Generation • 2B • Updated 4 days ago • 82
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr4e5_epochs7 Text Generation • 2B • Updated 4 days ago • 81
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz256_lr4e5_epochs5 Text Generation • 2B • Updated 4 days ago • 76
mlfoundations-dev/openthoughts3_100k_qwen25_1b_bsz512_lr2e5_epochs5 Text Generation • 2B • Updated 5 days ago • 78
mlfoundations-dev/Qwen2.5-7B-Instruct_openthoughts3_300k_annotated_Qwen3-32B Text Generation • 8B • Updated 6 days ago • 80 • 1
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-7B_OpenThoughts3 Text Generation • 8B • Updated 6 days ago • 76
mlfoundations-dev/DeepSeek-R1-Distill-Qwen-1.5B_OpenThoughts3 Text Generation • 2B • Updated 10 days ago • 138
mlfoundations-dev/QwQ-32B_enable-liger-kernel_False_OpenThoughts3_10k Text Generation • 33B • Updated 12 days ago • 132
mlfoundations-dev/QwQ-32B_enable-liger-kernel_False_OpenThoughts3_3k Text Generation • 33B • Updated 12 days ago • 260
mlfoundations-dev/QwQ-32B_enable-liger-kernel_False_OpenThoughts3_1k Text Generation • 33B • Updated 12 days ago • 326
mlfoundations-dev/Qwen2.5-7B-Instruct_openthoughts3_math_100k_annotated_QwQ-32B Text Generation • 8B • Updated 12 days ago • 324