Update README.md
Browse files
README.md
CHANGED
@@ -7,6 +7,11 @@ base_model:
|
|
7 |
- microsoft/phi-4
|
8 |
pipeline_tag: text-generation
|
9 |
---
|
|
|
|
|
|
|
|
|
|
|
10 |
| Filename | Quant type | File Size |
|
11 |
| -------- | ---------- | --------- |
|
12 |
| [phi-4_hb8_3bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_3bpw) | 3.00 bits per weight | 6.66 GB |
|
|
|
7 |
- microsoft/phi-4
|
8 |
pipeline_tag: text-generation
|
9 |
---
|
10 |
+
|
11 |
+
# Phi-4 converted for ExLlamaV2
|
12 |
+
|
13 |
+
[ExLlamaV2 is an inference library for running local LLMs on modern consumer GPUs.](https://github.com/turboderp-org/exllamav2)
|
14 |
+
|
15 |
| Filename | Quant type | File Size |
|
16 |
| -------- | ---------- | --------- |
|
17 |
| [phi-4_hb8_3bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_3bpw) | 3.00 bits per weight | 6.66 GB |
|