-
-
-
-
-
-
Inference Providers
Active filters:
instruct
NousResearch/Hermes-4-70B
Text Generation
•
71B
•
Updated
•
5.11k
•
•
152
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
7B
•
Updated
•
3.19k
•
237
Gryphe/Codex-24B-Small-3.2
24B
•
Updated
•
280
•
57
mlx-community/LFM2-8B-A1B-8bit-MLX
Text Generation
•
8B
•
Updated
•
249
•
2
TheBloke/Nous-Hermes-2-SOLAR-10.7B-GGUF
11B
•
Updated
•
3.59k
•
114
segolilylabs/Lily-Cybersecurity-7B-v0.2
Text Generation
•
7B
•
Updated
•
666
•
118
NousResearch/Nous-Hermes-2-Mistral-7B-DPO-GGUF
7B
•
Updated
•
10.7k
•
88
ArmurAI/Pentest_AI
Text Generation
•
7B
•
Updated
•
2.88k
•
29
aaditya/Llama3-OpenBioLLM-8B
Text Generation
•
Updated
•
6.08k
•
•
216
NousResearch/Hermes-2-Pro-Llama-3-8B-GGUF
8B
•
Updated
•
1.97k
•
162
bartowski/Pantheon-RP-1.0-8b-Llama-3-GGUF
Text Generation
•
8B
•
Updated
•
331
•
10
unsloth/mistral-7b-instruct-v0.3-bnb-4bit
Text Generation
•
4B
•
Updated
•
41.8k
•
31
Writer/Palmyra-Med-70B-32K
Text Generation
•
71B
•
Updated
•
10
•
119
Qwen/Qwen2-1.5B-Instruct-GGUF
Text Generation
•
2B
•
Updated
•
5.97k
•
27
yukiarimo/yuna-ai-v3-atomic
Text Generation
•
14B
•
Updated
•
10
NousResearch/Hermes-3-Llama-3.1-8B
Text Generation
•
8B
•
Updated
•
63.5k
•
•
361
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1
Text Generation
•
7B
•
Updated
•
21
•
16
bartowski/Hermes-3-Llama-3.2-3B-GGUF
Text Generation
•
3B
•
Updated
•
7.53k
•
10
mradermacher/Lily-Cybersecurity-7B-v0.2-i1-GGUF
7B
•
Updated
•
827
•
2
NousResearch/DeepHermes-3-Mistral-24B-Preview-GGUF
24B
•
Updated
•
219
•
32
INSAIT-Institute/MamayLM-Gemma-2-9B-IT-v0.1
Text Generation
•
9B
•
Updated
•
584
•
•
22
NousResearch/Hermes-4-405B
Text Generation
•
406B
•
Updated
•
847
•
•
69
sequelbox/Qwen3-4B-Thinking-2507-DAG-Reasoning
Text Generation
•
4B
•
Updated
•
362
•
5
NousResearch/Hermes-4-14B
Text Generation
•
425k
•
Updated
•
3.28k
•
88
cpatonn/Hermes-4-14B-AWQ-4bit
Text Generation
•
4B
•
Updated
•
130
•
1
Mungert/Hermes-4-14B-GGUF
15B
•
Updated
•
789
•
2
samunder12/llama-3.1-8b-Rp-tadashinu-gguf
Text Generation
•
8B
•
Updated
•
432
•
5
Ellbendls/Qwen-3-4b-Text_to_SQL-GGUF
Text Generation
•
4B
•
Updated
•
3.67k
•
3
mlx-community/granite-4.0-h-tiny-5bit-MLX
Text Generation
•
1B
•
Updated
•
206
•
2
TimesLast/Qwen3-4B-Thinking-2507-Esper3.1-Q6_K-GGUF
Text Generation
•
4B
•
Updated
•
13
•
1