Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

1,175

Full-text search

Active filters: llama.cpp

tifin-india/sarvam-m-24b-q3-k-s-gguf

Text Generation • 24B • Updated May 24 • 66

tifin-india/sarvam-m-24b-q3-k-gguf

Text Generation • 24B • Updated May 24 • 31

tifin-india/sarvam-m-24b-q4-k-m-gguf

Text Generation • 24B • Updated May 24 • 37 • 1

tifin-india/sarvam-m-24b-q3-k-m-gguf

Text Generation • 24B • Updated May 24 • 29

tifin-india/sarvam-m-24b-q4-k-s-gguf

Text Generation • 24B • Updated May 24 • 35

tifin-india/sarvam-m-24b-q5-k-m-gguf

Text Generation • 24B • Updated May 24 • 51 • 2

ykarout/MiMo-VL-7B-SFT-GGUF

Image-Text-to-Text • 8B • Updated Jun 2 • 124

and-emili/aera-4b-GGUF

Text Generation • 4B • Updated Sep 24 • 38

XythicK/Qwen.Qwen2.5-Math-1.5B-GGUF

2B • Updated Jun 5 • 90

Govind222/Koyna-V2-1b-instruct-GGUF

1.0B • Updated Jun 5

agentlans/SmolLM2-135M-Instruct-GGUF

0.1B • Updated Jun 6 • 46

ReallyFloppyPenguin/Holo1-3B-GGUF

3B • Updated Jun 10 • 92 • 2

mgonzs13/SpaceOm-GGUF

Image-Text-to-Text • 3B • Updated Jul 15 • 141 • 1

Darkhn-Quants/L3.3-70B-Animus-V1-GGUF

71B • Updated Jun 16 • 193

allura-quants/allura-org_Q3-8B-Kintsugi-GGUF

ReallyFloppyPenguin/sarvam-m-GGUF

24B • Updated Jun 14 • 40 • 1

ReallyFloppyPenguin/DeepSeek-R1-0528-Qwen3-8B-GGUF

8B • Updated Jul 5 • 167

ReallyFloppyPenguin/MiniCPM4-8B-GGUF

8B • Updated Jun 14 • 53

ReallyFloppyPenguin/Nemotron-Research-Reasoning-Qwen-1.5B-GGUF

2B • Updated Jun 14 • 39 • 1

ReallyFloppyPenguin/OpenCodeReasoning-Nemotron-14B-GGUF

15B • Updated Jun 16 • 43 • 1

ReallyFloppyPenguin/Jan-nano-GGUF

4B • Updated Jun 16 • 144

ReallyFloppyPenguin/Qwen2.5-Math-7B-GGUF

ReallyFloppyPenguin/Qwen3-0.6B-GGUF

0.8B • Updated Jun 16 • 73

ReallyFloppyPenguin/Holo1-7B-GGUF

8B • Updated Jun 16 • 33

ReallyFloppyPenguin/DeepSeek-R1-Distill-Qwen-32B-GGUF

33B • Updated Jul 5 • 14

ReallyFloppyPenguin/Gemma-3-Gaia-PT-BR-4b-it-GGUF

4B • Updated Jun 17 • 147

ReallyFloppyPenguin/Qwen3-30B-A3B-GGUF

31B • Updated Jun 18 • 39

ReallyFloppyPenguin/II-Medical-8B-1706-GGUF

8B • Updated Jun 20 • 205

Darkhn-Quants/L3.3-70B-Animus-V4-Final-GGUF

71B • Updated Jun 28 • 360

ReallyFloppyPenguin/Polaris-4B-Preview-GGUF

4B • Updated Jun 23 • 64