SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation Paper • 2506.18349 • Published Jun 23, 2025 • 13
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published Mar 18, 2025 • 50
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published Jan 7, 2025 • 81
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22, 2024 • 259
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3 Text Generation • 8B • Updated May 13, 2024 • 8
GeorgiaTech/0.0005_zephyr_withdpo_5551_4iters_bs256_newtrl_iter_3 Text Generation • 7B • Updated May 12, 2024 • 8
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_2 Text Generation • 8B • Updated May 12, 2024 • 8
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_1 Text Generation • 8B • Updated May 12, 2024 • 10
GeorgiaTech/0.0_llama_nodpo_3iters_bs128_531lr_iter_3 Text Generation • 8B • Updated May 12, 2024 • 4
GeorgiaTech/0.0_llama_nodpo_3iters_bs128_531lr_iter_2 Text Generation • 8B • Updated May 12, 2024 • 13
GeorgiaTech/0.0_llama_nodpo_3iters_bs128_531lr_iter_1 Text Generation • 8B • Updated May 12, 2024 • 7