DevQuasar
/

nvidia.Llama-3_1-Nemotron-Ultra-253B-CPT-v1-GGUF

Text Generation

Model card Files Files and versions

'Make knowledge free for everyone'

Quantized version of: nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1

Downloads last month: 492

GGUF

Model size

253B params

Architecture

deci

Hardware compatibility

Log In to view the estimation

1-bit

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DevQuasar/nvidia.Llama-3_1-Nemotron-Ultra-253B-CPT-v1-GGUF

Base model

nvidia/Llama-3_1-Nemotron-Ultra-253B-CPT-v1

Quantized

(1)

this model

Collection including DevQuasar/nvidia.Llama-3_1-Nemotron-Ultra-253B-CPT-v1-GGUF

Very Large GGUFs

GGUF quantized versions of very large models - over 100B parameters • 35 items • Updated 4 days ago • 4