EmbeddedLLM/bge-reranker-v2-m3-int4-sym-ov
Updated
•
10
EmbeddedLLM/bge-reranker-v2-m3-int4-ov
Updated
•
20
EmbeddedLLM/RedPajama-INCITE-Instruct-3B-v1-int4-sym-ov
EmbeddedLLM/neural-chat-7b-v1-1-int4-sym-ov
Updated
•
11
EmbeddedLLM/dolly-v2-3b-int4-sym-ov
Updated
•
11
EmbeddedLLM/Mistral-7B-Instruct-v0.3-int4-sym-ov
Updated
•
12
EmbeddedLLM/Phi-3-medium-4k-instruct-int4-sym-ov
EmbeddedLLM/Phi-3-mini-4k-instruct-int4-sym-ov
Updated
•
12
EmbeddedLLM/Qwen2-7B-Instruct-int4-sym-ov
Updated
•
14
EmbeddedLLM/Phi-3-medium-128k-instruct-int4-sym-ov
Updated
•
10
EmbeddedLLM/Meta-Llama-3.1-8B-Instruct-int4-sym-ov
Updated
•
10
EmbeddedLLM/Phi-3-mini-128k-instruct-int4-sym-ov
EmbeddedLLM/bge-reranker-v2-m3-onnx-o3-cpu
Updated
•
106
•
3
EmbeddedLLM/bge-m3-onnx-o2-cpu
Updated
•
24
EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-directml
Text Generation
•
Updated
•
15
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-directml
Text Generation
•
Updated
•
17
EmbeddedLLM/gemma-7b-it-int4-onnx-directml
EmbeddedLLM/openchat-3.6-8b-20240522-onnx-cpu-int4-rtn-block-32-acc-level-4
EmbeddedLLM/openchat-3.6-8b-20240522-onnx-cpu-int4-rtn-block-32
EmbeddedLLM/mistral-7b-instruct-v0.3-onnx-cpu-int4-rtn-block-32
EmbeddedLLM/openchat-3.6-8b-20240522-int4-onnx-directml
EmbeddedLLM/01-ai_Yi-1.5-6B-Chat-int4-onnx-directml
EmbeddedLLM/Starling-LM-7b-beta-int4-onnx-directml
EmbeddedLLM/mistral-7b-instruct-v0.3-onnx-cpu-int4-rtn-block-32-acc-level-4
EmbeddedLLM/gemma-2b-it-int4-onnx-directml
EmbeddedLLM/mistralai_Mistral-7B-Instruct-v0.3-int4-onnx-directml
EmbeddedLLM/Phi-3-mini-4k-instruct-062024-int4-onnx-directml
Text Generation
•
Updated
•
24
EmbeddedLLM/Phi-3-mini-128k-instruct-onnx-cpu-int4-rtn-block-32
Text Generation
•
Updated
•
22
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-cpu-int4-rtn-block-32
Text Generation
•
Updated
•
15
EmbeddedLLM/Phi-3-mini-4k-instruct-onnx-cpu-int4-rtn-block-32-acc-level-4
Text Generation
•
Updated
•
15