Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

626

Full-text search

Active filters: quantization

Octen/Octen-Embedding-8B-INT8

Sentence Similarity • 8B • Updated 2 days ago • 22 • 2

legraphista/DeepSeek-Coder-V2-Instruct-IMat-GGUF

Text Generation • 236B • Updated Jun 19, 2024 • 808 • 6

HyperX-Sentience/SDXL-GGUF

Text-to-Image • 3B • Updated Jun 24, 2025 • 378 • 12

stabilityai/stable-diffusion-3.5-large-tensorrt

Text-to-Image • Updated Oct 20, 2025 • 600 • 50

dougeeai/llama-cpp-python-wheels

Updated Nov 9, 2025 • 3

EricRollei/HunyuanImage-3-NF4-ComfyUI

Text-to-Image • 83B • Updated Nov 24, 2025 • 34 • 2

avtc/GLM-4.6-REAP-268B-A32B-GPTQMODEL-W4A16

Text Generation • 271B • Updated about 1 month ago • 144 • 3

drbaph/Qwen-Image-Edit-2511-FP8

Image-to-Image • Updated 25 days ago • 4.3k • 6

goniz/MiniMax-M2.1-REAP-30-GGUF

162B • Updated 3 days ago • 1.66k • 1

ryukin164/LFM2.5-1.2B-Q4-JP

Text Generation • 1B • Updated about 9 hours ago • 1

ethzanalytics/gpt-j-6B-8bit-sharded

Text Generation • 6B • Updated Jan 10, 2025 • 8 • 7

ethzanalytics/gpt-j-8bit-daily_dialogues

Text Generation • 6B • Updated Dec 25, 2024 • 13 • 4

ethzanalytics/gpt-j-8bit-KILT_WoW_10k_steps

Text Generation • Updated Nov 27, 2022 • 16

leumastai/t5-large-quantized

Updated Mar 16, 2023 • 6 • 1

pszemraj/stablelm-7b-sft-v7e3-autogptq-4bit-128g

Text Generation • Updated 20 days ago • 10 • 3

limcheekin/flan-t5-small-ct2

Updated May 24, 2023 • 7

limcheekin/flan-t5-xl-ct2

Updated Jun 3, 2023 • 8 • 1

limcheekin/flan-t5-xxl-ct2

Updated May 30, 2023 • 2 • 1

limcheekin/fastchat-t5-3b-ct2

Text Generation • Updated Jun 28, 2023 • 3 • 2

limcheekin/flan-alpaca-gpt4-xl-ct2

Updated Jun 4, 2023 • 2

limcheekin/mpt-7b-storywriter-ct2

Updated Jun 27, 2023 • 1

limcheekin/falcon-7b-instruct-ct2

Updated Jun 19, 2023 • 2 • 1

limcheekin/mpt-7b-instruct-ct2

Updated Jun 19, 2023 • 4

limcheekin/redpajama-chat-7b-ct2

Updated Jun 9, 2023 • 5

seonglae/wizardlm-7b-uncensored-gptq

Text Generation • Updated Jul 19, 2023 • 13

seonglae/llama-2-7b-chat-hf-gptq

Text Generation • Updated Jul 20, 2023 • 8

seonglae/llama-2-13b-chat-hf-gptq

Text Generation • Updated Jul 20, 2023 • 6

clibrain/Llama-2-7b-ft-instruct-es-gptq-4bit

Text Generation • Updated Sep 1, 2023 • 20 • 9

clibrain/Llama-2-13b-ft-instruct-es-gptq-4bit

Text Generation • Updated Sep 4, 2023 • 6 • 3

edumunozsala/llama-2-7b-int4-GPTQ-python-code-20k

Text Generation • Updated Sep 4, 2023 • 8 • 1