Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

30,790

Base only

Active filters: 8-bit

deepseek-ai/DeepSeek-V4-Pro-DSpark

Text Generation • 889B • Updated 2 days ago • 373 • 192

nvidia/GLM-5.2-NVFP4

Text Generation • 381B • Updated 2 days ago • 45.8k • 158

nvidia/Qwen3.6-35B-A3B-NVFP4

Text Generation • 19B • Updated 16 days ago • 5.24M • 372

deepseek-ai/DeepSeek-V4-Flash-DSpark

Text Generation • 165B • Updated 2 days ago • 24 • 84

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 7 days ago • 1.11M • • 5.1k

deepseek-ai/DeepSeek-V4-Flash

Text Generation • 158B • Updated 7 days ago • 1.99M • • 1.63k

nvidia/MiniMax-M3-NVFP4

Text Generation • 247B • Updated 3 days ago • 24.8k • 37

google/gemma-4-E2B-it-qat-mobile-transformers

Any-to-Any • 2B • Updated 24 days ago • 18.5k • 86

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 4.04M • • 4.93k

nvidia/Gemma-4-26B-A4B-NVFP4

Text Generation • 14B • Updated May 11 • 2.09M • 104

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Text Generation • 335B • Updated 5 days ago • 423k • • 224

openai/gpt-oss-20b

Text Generation • 22B • Updated Aug 26, 2025 • 7M • • 4.74k

0xSero/GLM-5.2-504B

Text Generation • 290B • Updated 4 days ago • 10.7k • 20

nvidia/DeepSeek-V4-Flash-NVFP4

Text Generation • 167B • Updated 14 days ago • 247k • 52

AEON-7/Ornith-1.0-35B-AEON-Ultimate-Uncensored-NVFP4

Text Generation • 21B • Updated about 5 hours ago • 14

OpenYourMind/GLM-5.2-abliterated

432B • Updated about 2 hours ago • 15

0xSero/GLM-5.2-504B-Nvidia

Text Generation • 293B • Updated 3 days ago • 31 • 12

PhalaCloud/GLM-5.2-W4AFP8

Text Generation • 392B • Updated 7 days ago • 12.2k • 22

sakamakismile/Ornith-1.0-35B-NVFP4

Image-Text-to-Text • 20B • Updated 3 days ago • 4.41k • 10

unsloth/Qwen3.6-27B-NVFP4

Image-Text-to-Text • 19B • Updated 29 days ago • 1.11M • 96

0xSero/DeepSeek-V4-Flash-180B

Text Generation • 102B • Updated 30 days ago • 4.95k • 29

lukealonso/GLM-5.2-NVFP4

Text Generation • 432B • Updated 12 days ago • 67.4k • 27

microsoft/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated Dec 17, 2025 • 8.56k • 1.47k

mlx-community/gemma-4-12b-coder-fable5-composer2.5-8bit

Text Generation • 12B • Updated 8 days ago • 5.07k • 15

DJLougen/Qwable-5-27B-Coder-NVFP4

Text Generation • 15B • Updated 6 days ago • 619 • 9

madeby561/GLM-5.2-NVFP4-REAP-504B-term

Text Generation • 290B • Updated 6 days ago • 1.3k • 13

sahilchachra/unlimited-ocr-mxfp8-mlx

Image-Text-to-Text • 1B • Updated 6 days ago • 1.03k • 7

autotrust/DeepSeek-V4-Flash-DSpark-4E

Text Generation • 165B • Updated 2 days ago • 7

AEON-7/Qwen3.6-35B-A3B-heretic-NVFP4

Image-Text-to-Text • 21B • Updated 1 day ago • 182k • 55

unsloth/Qwen3.6-35B-A3B-NVFP4

Image-Text-to-Text • 22B • Updated 29 days ago • 166k • 45