Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

7,869

Base only

Active filters: awq

prism-ml/Ternary-Bonsai-27B-AWQ-4bit

Image-Text-to-Text • 27B • Updated 3 days ago • 1.63k • • 14

prism-ml/Bonsai-27B-AWQ-4bit

Image-Text-to-Text • 27B • Updated 3 days ago • 716 • 6

cyankiwi/Qwen3.6-35B-A3B-AWQ-4bit

Image-Text-to-Text • 36B • Updated 14 days ago • 1.89M • 86

pearsonkyle/gemma4-31b-imatrix-mtp-GGUF

Image-Text-to-Text • 31B • Updated 6 days ago • 16.7k • 3

QuantTrio/Qwen3.6-35B-A3B-AWQ

Image-Text-to-Text • 36B • Updated Apr 17 • 1.07M • 30

QuantTrio/Qwen3.6-27B-AWQ

Image-Text-to-Text • 28B • Updated Apr 23 • 1.41M • 21

mconcat/Qwopus3.6-27B-v2-AWQ-4bit

Text Generation • 11B • Updated May 25 • 6.95k • 9

spectator2026/MiniMax-M3-AWQ-int4

Image-Text-to-Text • 69B • Updated 23 days ago • 1.53k • 3

Qwen/Qwen2.5-VL-32B-Instruct-AWQ

Image-Text-to-Text • 33B • Updated Apr 6, 2025 • 262k • 64

Qwen/Qwen3-4B-AWQ

Text Generation • 4B • Updated May 21, 2025 • 550k • 31

openbmb/MiniCPM-o-4_5-awq

Any-to-Any • 9B • Updated Jun 2 • 44.8k • 22

QuantTrio/Qwen3.5-122B-A10B-AWQ

Image-Text-to-Text • 125B • Updated Feb 26 • 29.1k • 29

QuantTrio/Qwen3.5-9B-AWQ

Image-Text-to-Text • 10B • Updated Mar 4 • 733k • 24

alonsoko/gemma-4-31b-it-abliterated-heretic-AWQ-W4A16

Image-Text-to-Text • 32B • Updated May 17 • 8.34k • 14

prism-ml/Bonsai-8B-AWQ-4-bit

Text Generation • 8B • Updated May 4 • 100 • 5

QuantTrio/Qwen3.6-27B-AWQ-6Bit

Image-Text-to-Text • 28B • Updated Apr 23 • 102k • 17

mattbucci/Qwen3.6-27B-AWQ

27B • Updated May 1 • 59.6k • 5

shawnw3i/Huihui-Qwen3.6-27B-abliterated-AWQ-MTP

Image-Text-to-Text • 6B • Updated May 22 • 29.1k • 11

AMAImedia/Qwen3-8B-Guard-Gen-NOESIS-AWQ-INT4

Text Generation • 8B • Updated May 17 • 10 • 2

kumar2235/Qwen3.5-4B-AWQ

Text Generation • 4B • Updated about 1 month ago • 428 • 3

JANGQ-AI/MiniMax-M3-REAP22-Coder

Text Generation • 34B • Updated 17 days ago • 1.89k • 4

Avesed/Qwen3.6-27B-INT4-W4A16

Text Generation • 28B • Updated 25 days ago • 1.24k • 1

sahilchachra/Qwythos-9B-Claude-Mythos-5-1M-AWQ

Text Generation • 9B • Updated 25 days ago • 6.49k • 3

Alennnndy/Qwen3.6-27B-AWQ-4bit

Text Generation • 28B • Updated 14 days ago • 1.54k • 2

seoilgun/Qwen3-8B-AWQ

8B • Updated 1 day ago • 18 • 1

casperhansen/mpt-7b-8k-chat-awq

Text Generation • Updated Nov 4, 2023 • 10 • 3

casperhansen/falcon-7b-awq

Text Generation • Updated Nov 4, 2023 • 15 • 1

casperhansen/vicuna-7b-v1.5-awq

Text Generation • Updated Oct 31, 2023 • 8 • 3

casperhansen/vicuna-7b-v1.5-awq-gemv

Text Generation • Updated Oct 31, 2023 • 6 • 1

casperhansen/mpt-7b-8k-chat-awq-gemv

Text Generation • Updated Oct 31, 2023 • 8