Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Cerebras
Novita
Fireworks
Replicate
fal
Nscale
Together AI
SambaNova
Nebius AI Studio
Hyperbolic
Cohere
HF Inference API
Misc
Reset Misc
llama-cpp
Inference Endpoints
Merge
text-generation-inference
Eval Results
4-bit precision
Mixture of Experts
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Apply filters
Models
19,829
Full-text search
Edit filters
Sort: Trending
Active filters:
llama-cpp
Clear all
Ransss/YiffyEstopianMaid-13B-Q6_K-GGUF
Updated
Jun 11, 2024
•
2
huangyt/module_v7-Q4_K_M-GGUF
Updated
Jun 11, 2024
Jarbas/all-MiniLM-L6-v2-Q4_K_M-GGUF
Sentence Similarity
•
Updated
Jun 11, 2024
•
38
ivar-vl/TinyLlama-1.1B-Chat-v1.0-Q4_K_M-GGUF
Updated
Jun 11, 2024
•
1
ccolicino/Mistral-7B-Instruct-v0.3-Q4_K_M-GGUF
Updated
Jun 11, 2024
•
4
eccheng/Phi-3-mini-128k-instruct-Q4_0-GGUF
Text Generation
•
Updated
Jun 11, 2024
•
17
frcp/Ocelot-Ko-self-instruction-10.8B-v1.0-Q4_K_M-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
3
gate369/Phi-3-mini-128k-instruct-IQ4_XS-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
8
frcp/gemma-summary-v01-Q4_K_M-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
2
waltervix/dolphin-2.9.2-qwen2-7b-Q4_K_M-GGUF
Updated
Jun 12, 2024
•
3
•
1
farpluto/Phi-3-medium-4k-instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
3
zhentaoyu/Llama-2-7b-chat-hf-Q4_0-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
9
evshiron/Llama-3-8B-Sydney-Q4_K_M-GGUF
Updated
Jun 12, 2024
magiccpp/open_llama_3b_v2-Q8_0-GGUF
Updated
Jun 12, 2024
•
8
huggingkot/dolphin-2.9.2-Phi-3-Medium-abliterated-Q4_K_M-GGUF
Updated
Jun 12, 2024
•
9
BLURPLETESTS/Llama3-Toxic-8B-imat-Q5_K_M-GGUF
Updated
Jun 14, 2024
•
2
•
1
zhaijunxiao/omost-llama-3-8b-Q8_0-GGUF
Updated
Jun 12, 2024
•
10
•
4
e2jhiubyiiyvw/Qwen2-7B-Instruct-Q5_K_M-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
1
raincandy-u/TinyStories-656K-Q8_0-GGUF
Updated
Jun 12, 2024
•
21
•
4
Tech-Meld/Hajax_Chat_1.0-Q3_K_S-GGUF
Updated
Jun 12, 2024
•
8
NikolayKozloff/CataLlama-v0.1-Instruct-SFT-Q8_0-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
3
•
1
debenoist/qlora_model_4_16bit-Q4_K_M-GGUF
Updated
Jun 12, 2024
NikolayKozloff/CataLlama-v0.1-Instruct-DPO-Q8_0-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
1
NikolayKozloff/Ko-Llama-3-8B-Instruct-Q8_0-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
1
•
1
NikolayKozloff/Ko-Qwen2-7B-Instruct-Q8_0-GGUF
Updated
Jun 12, 2024
•
5
•
3
albertodelazzari/Mistral-7B-Instruct-v0.2-Q4_K_M-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
1
NikolayKozloff/Tesser-Llama-3-Ko-8B-Q4_0-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
6
•
1
NikolayKozloff/Tesser-Llama-3-Ko-8B-Q5_0-GGUF
Text Generation
•
Updated
Jun 12, 2024
•
4
•
1
NikolayKozloff/Dorna-Llama3-8B-Instruct-IQ4_XS-GGUF
Updated
Jun 12, 2024
•
18
•
1
NikolayKozloff/Dorna-Llama3-8B-Instruct-IQ4_NL-GGUF
Updated
Jun 12, 2024
•
5
•
1
Previous
1
...
61
62
63
64
65
...
100
Next