Very Large GGUFs
Collection
GGUF quantized versions of very large models - over 100B parameters
•
35 items
•
Updated
•
4
'Make knowledge free for everyone'
Quantized version of: nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1
1-bit
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
Base model
nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1