-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
4.93k
•
1.15k
mlx-community/XBai-o4-8bit
Text Generation
•
33B
•
Updated
•
237
•
5
mlx-community/GLM-4.5-Air-8bit
Text Generation
•
107B
•
Updated
•
1.95k
•
4
MaziyarPanahi/Llama-3.2-1B-Instruct-GGUF
Text Generation
•
1B
•
Updated
•
153k
•
16
nvidia/DeepSeek-R1-FP4-v2
Text Generation
•
394B
•
Updated
•
49
•
3
ByteDance-Seed/Seed-X-PPO-7B-GPTQ-Int8
Translation
•
8B
•
Updated
•
627
•
5
MaziyarPanahi/Llama-3.3-70B-Instruct-GGUF
Text Generation
•
71B
•
Updated
•
184k
•
16
MaziyarPanahi/Qwen3-1.7B-GGUF
Text Generation
•
2B
•
Updated
•
154k
•
4
lmstudio-community/Qwen3-30B-A3B-Instruct-2507-MLX-8bit
Text Generation
•
31B
•
Updated
•
159k
•
2
cypher-hritam/gujjuGPT-v1
Text Generation
•
7B
•
Updated
•
17
•
2
QuantTrio/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
670
•
2
MaziyarPanahi/Meta-Llama-3-8B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
168k
•
94
MaziyarPanahi/Mistral-Nemo-Instruct-2407-GGUF
Text Generation
•
12B
•
Updated
•
161k
•
49
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
151k
•
23
HF1BitLLM/Llama3-8B-1.58-100B-tokens
Text Generation
•
3B
•
Updated
•
1.5k
•
192
MaziyarPanahi/solar-pro-preview-instruct-GGUF
Text Generation
•
22B
•
Updated
•
148k
•
26
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
149k
•
7
MaziyarPanahi/Llama-3.2-3B-Instruct-GGUF
Text Generation
•
3B
•
Updated
•
149k
•
14
MaziyarPanahi/MistralNemoTiny-GGUF
Text Generation
•
5B
•
Updated
•
64
•
2
MaziyarPanahi/Mistral-Large-Instruct-2411-GGUF
Text Generation
•
123B
•
Updated
•
148k
•
2
PrunaAI/Neo111x-falcon3-decompiler-7b-v1-bnb-8bit-smashed
7B
•
Updated
•
4
•
1
MISHANM/meta-Llama-3.3-70B-Instruct-int8
71B
•
Updated
•
76
•
1
mlx-community/DeepSeek-R1-Distill-Qwen-32B-MLX-8Bit
9B
•
Updated
•
1.34k
•
16
RedHatAI/Llama-3.3-70B-Instruct-quantized.w8a8
Text Generation
•
71B
•
Updated
•
42.5k
•
10
RedHatAI/phi-4-quantized.w8a8
Text Generation
•
15B
•
Updated
•
1.27k
•
2
MaziyarPanahi/gemma-3-1b-it-GGUF
Text Generation
•
1.0B
•
Updated
•
162k
•
7
MaziyarPanahi/gemma-3-4b-it-GGUF
Text Generation
•
4B
•
Updated
•
155k
•
10
shisa-ai/shisa-v2-unphi-14b-W8A8-INT8
15B
•
Updated
•
36
•
1
nvidia/Llama-4-Scout-17B-16E-Instruct-FP4
62B
•
Updated
•
1.24k
•
1
MaziyarPanahi/Qwen3-4B-GGUF
Text Generation
•
4B
•
Updated
•
164k
•
5