EmbeddedLLM/deepseek-r1-FP8-Dynamic
671B
•
Updated
EmbeddedLLM/Qwen2.5-1.5B-FP8-Dynamic
2B
•
Updated
•
77
EmbeddedLLM/Qwen2.5-1.5B-Instruct-FP8-Dynamic
2B
•
Updated
•
349
EmbeddedLLM/Qwen2.5-32B-Instruct-FP8-Dynamic
33B
•
Updated
•
46
EmbeddedLLM/Qwen2.5-7B-Instruct-FP8-Dynamic
8B
•
Updated
•
5
EmbeddedLLM/deepseekv3-lite-ci
Updated
EmbeddedLLM/Qwen_Qwen2.5-32B-Instruct-FP8-Dynamic
33B
•
Updated
•
44
EmbeddedLLM/Llama-3.1-8B-Instruct-w_fp8_per_channel_sym
Text Generation
•
8B
•
Updated
•
32
EmbeddedLLM/Nexusflow_Athena-V2-Agent-OCP-FP8-Quark
73B
•
Updated
•
16
EmbeddedLLM/Nexusflow_Athena-V2-Chat-OCP-FP8-Quark
73B
•
Updated
•
11
EmbeddedLLM/Qwen2.5-72B-Instruct-OCP-FP8-Quark
73B
•
Updated
•
8
EmbeddedLLM/ELLM_Star
EmbeddedLLM/bge-m3-int4-sym-ov
EmbeddedLLM/bge-m3-int4-ov
Updated
•
43
•
1
EmbeddedLLM/Qwen2.5-32B-Instruct-int4-sym-ov
Updated
•
13
EmbeddedLLM/Qwen2.5-14B-Instruct-int4-sym-ov
EmbeddedLLM/vLLM-AMD-flash-attn-debug
Updated
EmbeddedLLM/Llama-Guard-3-1B-int4-sym-ov
Updated
•
10
EmbeddedLLM/Llama-3.2-1B-Instruct-int4-sym-ov
Updated
•
15
EmbeddedLLM/Llama-3.2-3B-Instruct-int4-sym-ov
Updated
•
13
EmbeddedLLM/Llama-Guard-3-1B-int4-asym-ov
Updated
•
42
EmbeddedLLM/Llama-3.2-1B-Instruct-int4-asym-ov
Updated
•
11
EmbeddedLLM/Llama-3.2-3B-Instruct-int4-asym-ov
Updated
•
11
EmbeddedLLM/Qwen2.5-7B-Instruct-int4-sym-ov
EmbeddedLLM/Qwen2.5-3B-Instruct-int4-sym-ov
EmbeddedLLM/Qwen2.5-1.5B-Instruct-int4-sym-ov
EmbeddedLLM/Qwen2.5-0.5B-Instruct-int4-sym-ov
Updated
•
12
EmbeddedLLM/Llama-3.1-8B-Instruct-int4-asym-ov
EmbeddedLLM/Llama-3.1-70B-Instruct-int4-asym-ov
EmbeddedLLM/Phi-3.5-vision-instruct-int4-ov
Updated
•
35