-
-
-
-
-
-
Inference Providers
Active filters:
web-llm
mlc-ai/stablelm-2-zephyr-1_6b-q0f32-MLC
mlc-ai/stablelm-2-zephyr-1_6b-q4f32_1-MLC
Updated
•
26
mlc-ai/stablelm-2-zephyr-1_6b-q4f16_1-MLC
Updated
•
82
mlc-ai/Qwen1.5-0.5B-Chat-q3f16_1-MLC
mlc-ai/Phi-3-mini-4k-instruct-q4f32_1-MLC
Updated
•
488
mlc-ai/Llama-3-70B-Instruct-q4f16_1-MLC
mlc-ai/Phi-3-mini-128k-instruct-q0f16-MLC
Updated
•
2
•
1
mlc-ai/Mistral-7B-Instruct-v0.3-q0f16-MLC
mlc-ai/Mistral-7B-Instruct-v0.3-q4f16_1-MLC
Updated
•
3.56k
mlc-ai/Mistral-7B-Instruct-v0.3-q4f32_1-MLC
Updated
•
108
mlc-ai/Phi-3-mini-128k-instruct-q4f16_1-MLC
Updated
•
3
•
1
mlc-ai/Phi-3-mini-128k-instruct-q4f32_1-MLC
mlc-ai/Llama-3-8B-Instruct-q4f32_1-MLC
Updated
•
897
•
1
mlc-ai/Llama-3-8B-Instruct-q0f16-MLC
mlc-ai/Llama-3-8B-Instruct-q4f16_1-MLC
Updated
•
298
•
6
mlc-ai/Llama-3-8B-Instruct-q3f16_1-MLC
Updated
•
132
•
1
mlc-ai/Qwen1.5-0.5B-Chat-q4f32_1-MLC
mlc-ai/Qwen1.5-0.5B-Chat-q4f16_1-MLC
Updated
•
279
mlc-ai/Qwen1.5-0.5B-Chat-q0f16-MLC
mlc-ai/Qwen1.5-1.8B-Chat-q4f32_1-MLC
mlc-ai/Qwen1.5-1.8B-Chat-q4f16_1-MLC
mlc-ai/Qwen1.5-1.8B-Chat-q0f16-MLC
mlc-ai/TinyLlama-1.1B-Chat-v1.0-q4f32_1-MLC
Updated
•
238
mlc-ai/TinyLlama-1.1B-Chat-v1.0-q0f16-MLC
mlc-ai/Mistral-7B-Instruct-v0.3-q3f16_1-MLC
Updated
•
2.84k
mlc-ai/Mixtral-8x7B-Instruct-v0.1-q4f16_1-MLC
Updated
•
2
•
1
mlc-ai/Mixtral-8x7B-Instruct-v0.1-q4f32_1-MLC
mlc-ai/Mixtral-8x7B-Instruct-v0.1-q0f16-MLC
mlc-ai/Hermes-2-Pro-Llama-3-8B-q4f32_1-MLC
mlc-ai/Hermes-2-Pro-Llama-3-8B-q4f16_1-MLC
Updated
•
105