Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Novita
fal
Cohere
Nscale
Replicate
Hyperbolic
Cerebras
SambaNova
Together AI
Nebius AI Studio
Fireworks
HF Inference API
Misc
Reset Misc
llama-cpp
Inference Endpoints
Merge
text-generation-inference
Eval Results
4-bit precision
Mixture of Experts
8-bit precision
custom_code
Carbon Emissions
text-embeddings-inference
Apply filters
Models
19,700
Full-text search
Edit filters
Sort: Trending
Active filters:
llama-cpp
Clear all
Rivaidan/Smegmma-9B-v1-Q8_0-GGUF
Updated
Jul 8, 2024
•
23
notjjustnumbers/madlad400-3b-mt-Q4_K_M-GGUF
Translation
•
Updated
Jul 8, 2024
•
20
ethangoh/granite-8b-code-instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Jul 8, 2024
•
1
HenryyTwitchyfinger/L3-8B-Stheno-v3.2-Q4_K_M-GGUF
Updated
Jul 8, 2024
utterlygreat/omost-llama-3-8b-IQ4_NL-GGUF
Updated
Jul 8, 2024
•
1
teemperor/Phi-3-medium-128k-instruct-Q6_K-GGUF
Text Generation
•
Updated
Jul 8, 2024
•
2
zhezhe/Gemma-2-9B-Chinese-Chat-Q4_K_M-GGUF
Text Generation
•
Updated
Jul 8, 2024
•
3
NikolayKozloff/madlad400-3b-mt-Q8_0-GGUF
Translation
•
Updated
Jul 8, 2024
•
10
•
1
NikolayKozloff/madlad400-10b-mt-Q8_0-GGUF
Translation
•
Updated
Jul 8, 2024
•
27
•
3
NikolayKozloff/madlad400-10b-mt-Q6_K-GGUF
Translation
•
Updated
Jul 8, 2024
•
12
•
1
Cran-May/internlm2_5-7b-chat-Q4_K_M-GGUF
Text Generation
•
Updated
Jul 8, 2024
•
2
NikolayKozloff/sqt5-xl-Albanian-shqip-llama.cpp-compatible-Q8_0-GGUF
Updated
Jul 8, 2024
•
1
•
1
dimcha/mxbai-embed-large-v1-Q4_K_M-GGUF
Feature Extraction
•
Updated
Jul 8, 2024
•
32
bmi-labmedinfo/Igea-3B-v0.1-GGUF
Updated
Jul 15, 2024
Cran-May/internlm2_5-7b-chat-IQ4_XS-GGUF
Text Generation
•
Updated
Jul 8, 2024
•
6
carterprince/google-gemma-2-27b-it-ortho-Q4_K_S-GGUF
Updated
Jul 8, 2024
•
4
•
1
mrmage/Qwen2-0.5B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Jul 8, 2024
•
1
mrmage/Qwen2-0.5B-Instruct-Q3_K_M-GGUF
Text Generation
•
Updated
Jul 8, 2024
•
7
ZappY-AI/medllama3-v20-Q4_K_M-GGUF
Updated
Jul 8, 2024
•
4
martintomov/gemma-2-27b-it-Q8_0-GGUF
Text Generation
•
Updated
Jul 8, 2024
•
2
jorismathijssen/t5-base-Q4_K_M-GGUF
Translation
•
Updated
Jul 8, 2024
•
8
•
1
faceradix/Daredevil-8B-abliterated-Q4_K_M-GGUF
Updated
Jul 8, 2024
•
1
martintomov/gemma-2-9b-it-Q8_0-GGUF
Text Generation
•
Updated
Jul 8, 2024
•
1
Nialixus/Meta-Llama-3-8B-Q4_K_M-GGUF
Text Generation
•
Updated
Jul 8, 2024
saejoon/SOLAR-10.7B-Instruct-v1.0-Q4_K_M-GGUF
Updated
Jul 9, 2024
aifeifei798/Meta-Llama-3-8B-Instruct-Q5_K_M-GGUF
Text Generation
•
Updated
Jul 9, 2024
•
1
NikolayKozloff/bella-2-8b-Q8_0-GGUF
Text Generation
•
Updated
Jul 9, 2024
•
3
•
1
NikolayKozloff/Storm-7B-Q8_0-GGUF
Updated
Jul 9, 2024
•
1
•
2
NikolayKozloff/Einstein-v7-Qwen2-7B-Q8_0-GGUF
Updated
Jul 9, 2024
•
4
•
1
jeiku/qwen2-7b-magpie300k_filtered_epoch2-Q4_K_M-GGUF
Updated
Jul 9, 2024
•
4
Previous
1
...
83
84
85
86
87
...
100
Next