Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

206

Full-text search

Active filters: torchao

pytorch/Qwen3-4B-8da4w

Text Generation • Updated 13 days ago • 32

GooKSL/ccv_int8wo

andysalerno/Qwen3-8B-ao-autoquant

Text Generation • Updated May 9 • 19

andrewor14/Llama-3.1-8B-Instruct-float8dq

Text Generation • Updated May 9 • 13

HexLang/GPT2

Updated May 14 • 3

CJHauser/vibrance

Updated May 14 • 3

metascroy/Qwen3-4B-untied-8da4w-vllm-test

Text Generation • Updated May 15 • 3

GingerBled/DPO-Quantized_8bit_mock1

Text Generation • Updated May 21 • 3

GingerBled/DPO-Quantized_8bit_mock2

Text Generation • Updated May 21 • 3

sajal09/MNLP_M2_quantized_model2

Text Generation • Updated May 22 • 3

Erland/softpick-1.8B-4096-model-AO-W4A4

Text Generation • Updated May 22 • 3

Erland/softpick-1.8B-4096-model-AO-W4

Text Generation • Updated May 22 • 3

Erland/vanilla-1.8B-4096-model-AO-W4A4

Text Generation • Updated May 22 • 3

Erland/vanilla-1.8B-4096-model-AO-W4

Text Generation • Updated May 22 • 3

Cloudmaster/Llama-3.2-3B-torchao

Text Generation • Updated May 23 • 3

Jiqing/cuda_torchao_llama_68m

Updated May 23 • 6

Cloudmaster/Llama-3.2-3B-torchao-int4

Text Generation • Updated May 23 • 4

Cloudmaster/Llama-3.2-3B-torchao-int4-t4

Text Generation • Updated May 23 • 3

keko24/qwen-int8

Text Generation • Updated May 25 • 3

Cloudmaster/Llama-3.2-3B-torchao-autoquant

Text Generation • Updated May 26 • 3

Cloudmaster/Llama-3.2-3B-torchao-I8WI8A-attn

Text Generation • Updated May 26 • 3

Cloudmaster/Llama-3.2-3B-torchao-final

Text Generation • Updated May 27 • 3

oskdabk/first_quantized_model

Text Generation • Updated May 26 • 3

zay25/mnlp-model-A8W8-torchao

Text Generation • Updated May 26 • 2

keko24/qwen-int4

Text Generation • Updated May 26 • 3

zay25/test-A8W8

Text Generation • Updated May 26 • 3

Cloudmaster/Llama-3.2-3B-torchao-final00

Text Generation • Updated May 27 • 13

Cloudmaster/Llama-3.2-3B-torchao-final-woclass

Text Generation • Updated May 27 • 13

Cloudmaster/Llama-3.2-3B-torchao-final-wattn

Text Generation • Updated May 28 • 13 • 1

Cloudmaster/Llama-3.2-3B-torchao-final01

Text Generation • Updated May 27 • 7