Llama3-8B-1.58 A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! HF1BitLLM/Llama3-8B-1.58-100B-tokens Text Generation • 3B • Updated Sep 19, 2024 • 1.15k • 193 HF1BitLLM/Llama3-8B-1.58-Linear-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 46 • 10 HF1BitLLM/Llama3-8B-1.58-Sigmoid-k100-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 6 • 9
Llama3-8B-1.58 A trio of powerful models: fine-tuned from Llama3-8b-Instruct, with BitNet architecture! HF1BitLLM/Llama3-8B-1.58-100B-tokens Text Generation • 3B • Updated Sep 19, 2024 • 1.15k • 193 HF1BitLLM/Llama3-8B-1.58-Linear-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 46 • 10 HF1BitLLM/Llama3-8B-1.58-Sigmoid-k100-10B-tokens Text Generation • 3B • Updated Sep 18, 2024 • 6 • 9