-
-
-
-
-
-
Inference Providers
Active filters:
int4
ISTA-DASLab/gemma-3-27b-it-GPTQ-4b-128g
Image-Text-to-Text
•
5B
•
Updated
•
11.6k
•
37
RedHatAI/Llama-3.3-70B-Instruct-quantized.w4a16
Text Generation
•
11B
•
Updated
•
8.08k
•
2
RedHatAI/DeepSeek-R1-Distill-Qwen-1.5B-quantized.w4a16
Text Generation
•
0.6B
•
Updated
•
822
•
1
RedHatAI/phi-4-quantized.w4a16
Text Generation
•
3B
•
Updated
•
2.44k
•
3
ISTA-DASLab/gemma-3-4b-it-GPTQ-4b-128g
Image-Text-to-Text
•
2B
•
Updated
•
1.66k
•
6
ISTA-DASLab/gemma-3-12b-it-GPTQ-4b-128g
Image-Text-to-Text
•
3B
•
Updated
•
10k
•
6
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16
Image-Text-to-Text
•
5B
•
Updated
•
2.42k
•
7
Advantech-EIOT/intel_llama-2-chat-7b
Text Generation
•
Updated
•
4
RedHatAI/zephyr-7b-beta-marlin
Text Generation
•
1B
•
Updated
•
159
RedHatAI/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
•
0.3B
•
Updated
•
5.47k
•
1
RedHatAI/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
•
1B
•
Updated
•
127
•
2
RedHatAI/Nous-Hermes-2-Yi-34B-marlin
Text Generation
•
5B
•
Updated
•
9
•
5
ecastera/ecastera-eva-westlake-7b-spanish-int4-gguf
7B
•
Updated
•
10
•
2
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
•
10B
•
Updated
•
4
softmax/falcon-180B-chat-marlin
Text Generation
•
26B
•
Updated
•
9
study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
4
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4
Text Generation
•
11B
•
Updated
•
4
•
6
study-hjt/Meta-Llama-3-70B-Instruct-AWQ
Text Generation
•
11B
•
Updated
•
4
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
•
17B
•
Updated
•
5
•
2
study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int4
Text Generation
•
2B
•
Updated
•
5
study-hjt/Qwen1.5-110B-Chat-AWQ
Text Generation
•
17B
•
Updated
•
5
modelscope/Yi-1.5-34B-Chat-AWQ
Text Generation
•
5B
•
Updated
•
26
•
1
modelscope/Yi-1.5-6B-Chat-GPTQ
Text Generation
•
1B
•
Updated
•
11
modelscope/Yi-1.5-6B-Chat-AWQ
Text Generation
•
1B
•
Updated
•
7
modelscope/Yi-1.5-9B-Chat-GPTQ
Text Generation
•
2B
•
Updated
•
5
•
1
modelscope/Yi-1.5-9B-Chat-AWQ
Text Generation
•
2B
•
Updated
•
14
modelscope/Yi-1.5-34B-Chat-GPTQ
Text Generation
•
5B
•
Updated
•
4
•
1
jojo1899/Phi-3-mini-128k-instruct-ov-int4
Text Generation
•
Updated
•
4
jojo1899/Llama-2-13b-chat-hf-ov-int4
Text Generation
•
Updated
•
4
jojo1899/Mistral-7B-Instruct-v0.2-ov-int4
Text Generation
•
Updated
•
3