Edit Models filters

Apps

Docker Model Runner

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

17

Full-text search

Active filters: efficiency

Shahradmz/HyenaDistilledPythia70M

Text Generation • Updated Jan 10, 2024

sapienzanlp/maverick-mes-litbank

Updated Aug 12, 2024 • 6 • 4

1-800-LLMs/Qwen-2.5-14B-Hindi

15B • Updated Feb 13 • 2

1024m/PHI-4-Hindi

15B • Updated Feb 13 • 2 • 1

1024m/PHI-4-Hindi-LoRA

large-traversaal/Mantra-14B

15B • Updated Apr 13 • 6 • 2

DrishtiSharma/qwen-2.5-14b

large-traversaal/Qwen-2.5-14B-Hindi

15B • Updated Mar 3 • 5 • 4

mradermacher/Qwen-2.5-14B-Hindi-GGUF

15B • Updated 12 days ago • 41 • 1

sst12345/CoRe2

Text-to-Image • Updated Mar 18 • 2

mradermacher/Mantra-14B-GGUF

15B • Updated Jul 11 • 77

mradermacher/Mantra-14B-i1-GGUF

15B • Updated Jul 11 • 115

codelion/Qwen3-0.6B-accuracy-recovery-lora

Text Generation • Updated about 1 month ago • 8 • 1

prompterminal/gpt2-compressed

Text Generation • 2B • Updated 21 days ago • 8

GY2233/R2R_router_qwen3-1.7b

Text Classification • Updated 21 days ago • 3

GY2233/R2R_router_qwen3-4b

Text Classification • Updated 20 days ago • 6

GY2233/R2R_router_qwenr1

Text Classification • Updated 19 days ago • 3