-
-
-
-
-
-
Inference Providers
Active filters:
8-bit
codys12/Qwen3-8B-BitNet
Text Generation
•
3B
•
Updated
•
435
•
11
microsoft/bitnet-b1.58-2B-4T
Text Generation
•
0.8B
•
Updated
•
9.49k
•
1.13k
mlx-community/DiffuCoder-7B-cpGRPO-8bit
Text Generation
•
8B
•
Updated
•
1.13k
•
6
mlx-community/SmolLM3-3B-8bit
Text Generation
•
0.9B
•
Updated
•
375
•
5
mlx-community/Jan-nano-8bit
Text Generation
•
1B
•
Updated
•
477
•
5
MaziyarPanahi/Meta-Llama-3-8B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
195k
•
93
MaziyarPanahi/Yi-Coder-1.5B-Chat-GGUF
Text Generation
•
1B
•
Updated
•
181k
•
13
baidu/ERNIE-4.5-300B-A47B-2Bits-Paddle
Text Generation
•
92B
•
Updated
•
64
•
12
lmstudio-community/Devstral-Small-2507-MLX-8bit
Text Generation
•
24B
•
Updated
•
92.4k
•
2
MaziyarPanahi/ChatMusician-GGUF
Text Generation
•
7B
•
Updated
•
321
•
14
MaziyarPanahi/WizardLM-2-7B-GGUF
Text Generation
•
7B
•
Updated
•
179k
•
82
RichardErkhov/gp-tar4_-_QA_FineTuned_ArabianGPT-03B-8bits
Text Generation
•
0.4B
•
Updated
•
10
•
1
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
•
7B
•
Updated
•
213k
•
106
RedHatAI/Meta-Llama-3-70B-Instruct-quantized.w8a16
Text Generation
•
19B
•
Updated
•
1.05k
•
5
meta-llama/Llama-Guard-3-8B-INT8
Text Generation
•
8B
•
Updated
•
4.18k
•
37
MaziyarPanahi/Meta-Llama-3.1-70B-Instruct-GGUF
Text Generation
•
71B
•
Updated
•
177k
•
40
Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
3B
•
Updated
•
1.61k
•
30
MaziyarPanahi/Qwen2.5-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
177k
•
6
RedHatAI/Qwen2.5-7B-Instruct-quantized.w8a8
Text Generation
•
8B
•
Updated
•
1.28k
•
2
MaziyarPanahi/MistralNemoTiny-GGUF
Text Generation
•
5B
•
Updated
•
52
•
1
Qwen/Qwen2.5-Coder-32B-Instruct-GPTQ-Int8
Text Generation
•
10B
•
Updated
•
2.85k
•
21
MaziyarPanahi/Llama-3.3-70B-Instruct-GGUF
Text Generation
•
71B
•
Updated
•
198k
•
14
tiiuae/Falcon3-10B-Instruct-1.58bit
Text Generation
•
3B
•
Updated
•
1.18k
•
19
mlx-community/Qwen3-235B-A22B-8bit
Text Generation
•
66B
•
Updated
•
112k
•
3
ccckblaze/Seed-Coder-8B-Instruct-MLX
2B
•
Updated
•
3
•
1
oscarstories/lorastral24b_0604
Text Generation
•
24B
•
Updated
•
101
•
2
Qwen/Qwen3-32B-MLX-8bit
Text Generation
•
9B
•
Updated
•
435
•
5
LogicBombaklot/Kimi-Dev-72B-mlx-8Bit
20B
•
Updated
•
580
•
2
RedHatAI/Qwen3-32B-NVFP4
Text Generation
•
19B
•
Updated
•
130
•
1
mlx-community/ERNIE-4.5-21B-A3B-PT-8bit
Text Generation
•
22B
•
Updated
•
442
•
2