RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation • 8B • Updated 26 days ago • 38.1k • 9
RedHatAI/Llama-3.1-8B-Instruct-speculator.eagle3 Text Generation • 1.0B • Updated 18 days ago • 6.53k • 1
Running 1 Quantization Formats And Cuda Compute Capability Support 🧠 1 Quantization Formats & CUDA Compute Capability Support