-
-
-
-
-
-
Inference Providers
Active filters:
4-bit
Qwen/Qwen3-30B-A3B-GPTQ-Int4
Text Generation
•
5B
•
Updated
•
51.9k
•
34
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
•
31B
•
Updated
•
49.2k
•
22
unsloth/Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
5B
•
Updated
•
83.8k
•
48
dousery/medical-reasoning-gpt-oss-20b
Text Generation
•
21B
•
Updated
•
1.41k
•
38
lmstudio-community/Qwen3-VL-4B-Instruct-MLX-4bit
Image-Text-to-Text
•
1B
•
Updated
•
19.5k
•
3
MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
93.4k
•
28
huihui-ai/Huihui-GLM-4.5-Air-abliterated-mlx-mxfp4
Text Generation
•
107B
•
Updated
•
1.06k
•
5
mlx-community/Qwen3-VL-30B-A3B-Instruct-4bit
Image-Text-to-Text
•
Updated
•
2.02k
•
5
unsloth/Qwen3-VL-8B-Instruct-bnb-4bit
Image-Text-to-Text
•
9B
•
Updated
•
124
•
2
unsloth/Qwen3-VL-4B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
5B
•
Updated
•
622
•
2
TheBloke/Llama-2-7B-Chat-AWQ
Text Generation
•
1B
•
Updated
•
3.56k
•
24
TheBloke/deepseek-coder-6.7B-instruct-AWQ
Text Generation
•
1B
•
Updated
•
265k
•
18
TheBloke/Qwen-7B-Chat-GPTQ
Text Generation
•
2B
•
Updated
•
393
•
3
unsloth/tinyllama-bnb-4bit
Text Generation
•
0.6B
•
Updated
•
4.35k
•
12
TheBloke/medicine-LLM-AWQ
Text Generation
•
1B
•
Updated
•
16
•
4
jiayihao03/gemma2b_code_java
Text Generation
•
2B
•
Updated
•
7
•
2
alexlangshur/WizardLM-2-7B-AWQ
Text Generation
•
1B
•
Updated
•
5
•
2
MaziyarPanahi/Meta-Llama-3-8B-Instruct-GGUF
Text Generation
•
8B
•
Updated
•
121k
•
98
unsloth/llama-3-8b-bnb-4bit
Text Generation
•
5B
•
Updated
•
37k
•
201
TechxGenus/Meta-Llama-3-8B-Instruct-GPTQ
Text Generation
•
2B
•
Updated
•
4.84k
•
6
unsloth/Phi-3-mini-4k-instruct-bnb-4bit
Text Generation
•
2B
•
Updated
•
52.1k
•
35
MehdiHosseiniMoghadam/AVA-Qwen1.5-7B-Chat-gptq-4bit
Text Generation
•
2B
•
Updated
•
1
•
1
webbigdata/C3TR-Adapter_gptq
Translation
•
2B
•
Updated
•
3
•
2
MaziyarPanahi/Mistral-7B-Instruct-v0.3-GGUF
Text Generation
•
7B
•
Updated
•
109k
•
121
unsloth/Qwen2-0.5B-Instruct-bnb-4bit
Text Generation
•
0.3B
•
Updated
•
1.62k
•
6
unsloth/Qwen2-0.5B-bnb-4bit
Text Generation
•
0.3B
•
Updated
•
669
•
3
unsloth/Mistral-Nemo-Instruct-2407-bnb-4bit
Text Generation
•
7B
•
Updated
•
4.62k
•
31
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
2B
•
Updated
•
103k
•
77
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
Text Generation
•
5B
•
Updated
•
903k
•
85
Statuo/MN-12b-ArliAI-RPMax-EXL2-4bpw
Updated
•
5
•
2