RedHatAI/Llama-3.2-11B-Vision-Instruct-FP8-dynamic Text Generation • 11B • Updated Oct 2, 2024 • 4.57k • 24
RedHatAI/DeepSeek-R1-Distill-Qwen-32B-quantized.w8a8 Text Generation • 33B • Updated Feb 27 • 3.54k • 11
ISTA-DASLab/Mistral-Small-3.1-24B-Instruct-2503-GPTQ-4b-128g Image-Text-to-Text • 5B • Updated Apr 6 • 3.66k • 14
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16 Image-Text-to-Text • 5B • Updated 29 days ago • 4.22k • 5
nm-testing/tinyllama-one-shot-static-quant-test-compressed Text Generation • 1B • Updated Oct 9, 2024 • 23
nm-testing/tinyllama-one-shot-w4a16-channel-compressed Text Generation • 1B • Updated Oct 9, 2024 • 50
nm-testing/tinyllama-one-shot-w4a16-group128-packed Text Generation • 0.3B • Updated Oct 9, 2024 • 44