RedHatAI/DeepSeek-R1-Distill-Llama-8B-quantized.w8a8 Text Generation • 8B • Updated Feb 27 • 2.28k • 2
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w4a16 Text Generation • 11B • Updated Feb 12 • 6.03k • 32
RedHatAI/Meta-Llama-3.1-70B-Instruct-quantized.w8a8 Text Generation • 71B • Updated Feb 11 • 8.04k • 20
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8 Text Generation • 71B • Updated Jan 3 • 10
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16 Text Generation • 11B • Updated Jan 3 • 20
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic Text Generation • 8B • Updated Dec 19, 2024 • 25 • 1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic Text Generation • 8B • Updated Dec 19, 2024 • 24
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic Text Generation • 8B • Updated Dec 19, 2024 • 41 • 1
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16 Text Generation • 2B • Updated Dec 19, 2024 • 32