-
-
-
-
-
-
Inference Providers
Active filters:
awq
mratsim/MiniMax-M2.1-FP8-INT4-AWQ
Text Generation
•
Updated
•
5.69k
•
39
openbmb/MiniCPM-o-4_5-awq
Any-to-Any
•
9B
•
Updated
•
931
•
15
mratsim/Minimax-M2.5-BF16-INT4-AWQ
Text Generation
•
39B
•
Updated
•
6
Qwen/Qwen2.5-32B-Instruct-AWQ
Text Generation
•
33B
•
Updated
•
902k
•
94
Qwen/Qwen2.5-Coder-14B-Instruct-AWQ
Text Generation
•
15B
•
Updated
•
85.3k
•
15
Text Generation
•
15B
•
Updated
•
653k
•
55
Text Generation
•
Updated
•
17.5k
•
25
mratsim/MiniMax-M2.1-BF16-INT4-AWQ
Text Generation
•
39B
•
Updated
•
4.64k
•
7
QuantTrio/GLM-4.7-Flash-AWQ
Text Generation
•
31B
•
Updated
•
136k
•
5
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
•
14B
•
Updated
•
95.5k
•
4
TheBloke/Sheep-Duck-Llama-2-70B-AWQ
Text Generation
•
69B
•
Updated
•
4
•
1
TheBloke/sheep-duck-llama-2-13B-AWQ
Text Generation
•
13B
•
Updated
•
8
•
2
TheBloke/deepseek-coder-1.3b-instruct-AWQ
Text Generation
•
1B
•
Updated
•
188
•
5
Text Generation
•
7B
•
Updated
•
2.75k
•
4
modelscope/Yi-1.5-34B-Chat-AWQ
Text Generation
•
34B
•
Updated
•
64
•
2
TechxGenus/DeepSeek-Coder-V2-Lite-Instruct-AWQ
Text Generation
•
16B
•
Updated
•
6.08k
•
8
TechxGenus/DeepSeek-Coder-V2-Lite-Base-AWQ
Text Generation
•
16B
•
Updated
•
25
•
3
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
Updated
•
497k
•
87
Qwen/Qwen2.5-7B-Instruct-AWQ
Text Generation
•
Updated
•
237k
•
36
lurker18/Llama_3.1_8B_Instruct_AWQ_4bit
Text Generation
•
8B
•
Updated
•
78
•
1
Text Generation
•
33B
•
Updated
•
335k
•
125
Text Generation
•
Updated
•
113k
•
37
Text Generation
•
4B
•
Updated
•
126k
•
24
twhitworth/gpt-oss-120b-awq-w4a16
117B
•
Updated
•
11.8k
•
21
openbmb/MiniCPM-V-4_5-AWQ
Image-Text-to-Text
•
9B
•
Updated
•
3.41k
•
13
QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
•
236B
•
Updated
•
6.86k
•
13
QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ
Text Generation
•
236B
•
Updated
•
1.2k
•
7
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
•
31B
•
Updated
•
475k
•
37
QuantTrio/Qwen3-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
33B
•
Updated
•
87.3k
•
9
QuantTrio/DeepSeek-V3.2-AWQ
Text Generation
•
685B
•
Updated
•
58.2k
•
11