distributed/optimized-gpt2-250m-convergence-test-v1 Text Generation • 0.3B • Updated Sep 24, 2024 • 9
distributed/optimized-gpt2-250m-convergence-test-v2 Text Generation • 0.3B • Updated Sep 24, 2024 • 44 • 1
distributed/optimized-gpt2-2b-without-stable-embeddings Text Generation • 2B • Updated Dec 24, 2024 • 29