Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

22,857

Full-text search

Active filters: llama-cpp

eccheng/Phi-3-mini-128k-instruct-Q4_0-GGUF

Text Generation • 4B • Updated Jun 11, 2024 • 29

frcp/Ocelot-Ko-self-instruction-10.8B-v1.0-Q4_K_M-GGUF

Text Generation • 11B • Updated Jun 12, 2024

gate369/Phi-3-mini-128k-instruct-IQ4_XS-GGUF

Text Generation • 4B • Updated Jun 12, 2024 • 18

frcp/gemma-summary-v01-Q4_K_M-GGUF

Text Generation • 3B • Updated Jun 12, 2024 • 6

waltervix/dolphin-2.9.2-qwen2-7b-Q4_K_M-GGUF

8B • Updated Jun 12, 2024 • 5 • 1

farpluto/Phi-3-medium-4k-instruct-Q4_K_M-GGUF

Text Generation • 14B • Updated Jun 12, 2024 • 1

zhentaoyu/Llama-2-7b-chat-hf-Q4_0-GGUF

Text Generation • 7B • Updated Jun 12, 2024 • 15

evshiron/Llama-3-8B-Sydney-Q4_K_M-GGUF

8B • Updated Jun 12, 2024

magiccpp/open_llama_3b_v2-Q8_0-GGUF

3B • Updated Jun 12, 2024 • 1

huggingkot/dolphin-2.9.2-Phi-3-Medium-abliterated-Q4_K_M-GGUF

14B • Updated Jun 12, 2024 • 3

BLURPLETESTS/Llama3-Toxic-8B-imat-Q5_K_M-GGUF

8B • Updated Jun 14, 2024 • 1 • 1

zhaijunxiao/omost-llama-3-8b-Q8_0-GGUF

8B • Updated Jun 12, 2024 • 9 • 4

e2jhiubyiiyvw/Qwen2-7B-Instruct-Q5_K_M-GGUF

Text Generation • 8B • Updated Jun 12, 2024

raincandy-u/TinyStories-656K-Q8_0-GGUF

0.0B • Updated Jun 12, 2024 • 25 • 4

Tech-Meld/Hajax_Chat_1.0-Q3_K_S-GGUF

7B • Updated Jun 12, 2024 • 5

NikolayKozloff/CataLlama-v0.1-Instruct-SFT-Q8_0-GGUF

Text Generation • 8B • Updated Jun 12, 2024 • 3 • 1

debenoist/qlora_model_4_16bit-Q4_K_M-GGUF

8B • Updated Jun 12, 2024 • 2

NikolayKozloff/CataLlama-v0.1-Instruct-DPO-Q8_0-GGUF

Text Generation • 8B • Updated Jun 12, 2024 • 1 • 1

NikolayKozloff/Ko-Llama-3-8B-Instruct-Q8_0-GGUF

Text Generation • 8B • Updated Jun 12, 2024 • 3 • 1

NikolayKozloff/Ko-Qwen2-7B-Instruct-Q8_0-GGUF

8B • Updated Jun 12, 2024 • 14 • 3

albertodelazzari/Mistral-7B-Instruct-v0.2-Q4_K_M-GGUF

Text Generation • 7B • Updated Jun 12, 2024 • 2

NikolayKozloff/Tesser-Llama-3-Ko-8B-Q4_0-GGUF

Text Generation • 8B • Updated Jun 12, 2024 • 2 • 1

NikolayKozloff/Tesser-Llama-3-Ko-8B-Q5_0-GGUF

Text Generation • 8B • Updated Jun 12, 2024 • 3 • 1

NikolayKozloff/Dorna-Llama3-8B-Instruct-IQ4_XS-GGUF

8B • Updated Jun 12, 2024 • 2 • 1

NikolayKozloff/Dorna-Llama3-8B-Instruct-IQ4_NL-GGUF

8B • Updated Jun 12, 2024 • 2 • 1

NikolayKozloff/shotor-Q8_0-GGUF

Text Generation • 8B • Updated Jun 12, 2024 • 1

VlSav/saiga_qwen2_7b_sft_m2_d6_kto_m1_d5-Q6_K-GGUF

8B • Updated Jun 12, 2024

huggingkot/Llama-3-8B-Instruct-abliterated-v2-Q4_K_M-GGUF

8B • Updated Jun 12, 2024 • 16

albertodelazzari/Mistral-7B-Instruct-v0.3-Q8_0-GGUF

7B • Updated Jun 12, 2024 • 27

sygenaithanos/Llama-3-8B-Instruct-Gradient-1048k-Q4_0-GGUF

Text Generation • 8B • Updated Jun 12, 2024 • 3