Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Edit Models filters

Apps
Backyard AI
Jan
Jellybox
llama.cpp
LM Studio
LocalAI
Msty
node-llama-cpp
Ollama
RecurseChat
Sanctum
TGI
vLLM

Apps with no match

Draw Things
DiffusionBee
Invoke
JoyFusion
MLX LM
Inference Providers

Inference Providers with no match

Featherless AI
Replicate
Groq
Cerebras
Nscale
Novita
fal
Cohere
Nebius AI
Fireworks
SambaNova
Hyperbolic
Together AI
HF Inference API
Misc
quantization
Inference Endpoints
4-bit precision
text-generation-inference
8-bit precision
custom_code
Eval Results
Merge

Misc with no match

text-embeddings-inference
Carbon Emissions
Mixture of Experts

Models

305
Full-text search
Active filters: quantization

avinashhm/Llama-3.1-Nemotron-Nano-4B-v1.1-GPTQ

Text Generation • Updated 2 days ago • 32

raul-delarosa99/bert-base-multilingual-cased-ner-es-onnx-static-int8

Token Classification • Updated 2 days ago • 20

brandonbeiler/InternVL3-38B-FP8-Dynamic

Image-Text-to-Text • Updated about 12 hours ago

brandonbeiler/InternVL3-78B-FP8-Dynamic

Image-Text-to-Text • Updated about 12 hours ago

brandonbeiler/InternVL3-38B-BNB-8bit

Image-Text-to-Text • Updated about 10 hours ago
  • Previous
  • 1
  • ...
  • 9
  • 10
  • 11
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs