Edit Models filters

Apps

Apps with no match

Inference Providers

Inference Providers with no match

HF Inference API

Misc

Inference Endpoints

4-bit precision

text-generation-inference

8-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

305

Full-text search

Active filters: quantization

avinashhm/Llama-3.1-Nemotron-Nano-4B-v1.1-GPTQ

Text Generation • Updated 2 days ago • 32

raul-delarosa99/bert-base-multilingual-cased-ner-es-onnx-static-int8

Token Classification • Updated 2 days ago • 20

brandonbeiler/InternVL3-38B-FP8-Dynamic

Image-Text-to-Text • Updated about 12 hours ago

brandonbeiler/InternVL3-78B-FP8-Dynamic

Image-Text-to-Text • Updated about 12 hours ago

brandonbeiler/InternVL3-38B-BNB-8bit

Image-Text-to-Text • Updated about 10 hours ago