Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

1,129

Full-text search

Active filters: llama.cpp

wasmedge/llama2

Text Generation • 13B • Updated Nov 11, 2023 • 148 • 8

alvarobartt/lince-zero-7b-GGUF

Text Generation • 7B • Updated Nov 1, 2023 • 76

vietgpt/dama-2-7b-chat-gguf

Text Generation • 7B • Updated Nov 17, 2023 • 29 • 1

FlexingD/yarn-mistral-7B-64k-instruct-alpaca-cleaned-GGUF

7B • Updated Dec 4, 2023 • 388 • 5

jdluzen/Mistral-7B-Instruct-v0.2-GGUF

7B • Updated Dec 24, 2023 • 21

jacobhoffmann/CodeLlama-13B-TestGen-Dart_v0.2-GGUF

Text Generation • 13B • Updated Dec 11, 2024 • 71

mostafaamiri/persian-llama-7b-GGUF-Q4

7B • Updated Jan 13, 2024 • 571 • 9

ayoubkirouane/Mistral-Depth-UP-Scaled-9B-AlpacaInstruct-gguf

Text Generation • 9B • Updated Jan 24, 2024 • 35

ehristoforu/LLMs

Text Generation • 7B • Updated Apr 16, 2024 • 10 • 4

osanseviero/DareVox-7B-AWQ

7B • Updated Feb 7, 2024 • 4

ahmetkca/trendyol-7B-v1.0-f16-gguf

7B • Updated Feb 15, 2024 • 8

ahmetkca/trendyol-7B-v1.0-f32-gguf

7B • Updated Feb 15, 2024 • 32

google/gemma-7b-it-GGUF

9B • Updated Aug 14, 2024 • 63 • 44

google/gemma-7b-GGUF

9B • Updated Jun 27, 2024 • 84 • 21

google/gemma-2b-it-GGUF

3B • Updated Jun 27, 2024 • 118 • 19

google/gemma-2b-GGUF

3B • Updated Jun 27, 2024 • 127 • 16

iAkashPaul/Indic-gemma-2b-finetuned-sft-Navarasa-GGUF

3B • Updated Mar 8, 2024 • 42 • 3

MrOvkill/gemma-2-inference-endpoint-GGUF

Text Generation • Updated Mar 11, 2024 • 62

google/gemma-1.1-7b-it-GGUF

9B • Updated Jun 27, 2024 • 10 • 20

google/gemma-1.1-2b-it-GGUF

3B • Updated Jun 27, 2024 • 1 • 19

webbigdata/C3TR-Adapter_gguf

Translation • 9B • Updated Aug 14, 2024 • 708 • 26

google/codegemma-2b-GGUF

Text Generation • 3B • Updated Jun 27, 2024 • 70 • 28

google/codegemma-7b-GGUF

Text Generation • 9B • Updated Jun 27, 2024 • 60 • 24

google/codegemma-7b-it-GGUF

Text Generation • 9B • Updated Jun 27, 2024 • 86 • 61

pacozaa/bonito-gguf

7B • Updated Apr 14, 2024 • 35

pmking27/PrathameshLLM-2B-GGUF

3B • Updated Apr 9, 2024 • 1.95k • 1

teleprint-me/cyberpunk-valerie-v0.1

Text Generation • 90.1M • Updated Apr 18, 2024 • 125 • 1

qwp4w3hyb/Meta-Llama-3-8B-Instruct-iMat-GGUF

Text Generation • 8B • Updated Apr 29, 2024 • 1.14k • 6

mgonzs13/Mistroll-7B-v2.2-GGUF

Text Generation • 7B • Updated Apr 29, 2024 • 190

mgonzs13/ladybird-base-7B-v8-GGUF

Text Generation • 7B • Updated Apr 29, 2024 • 44