Update README.md
Browse files
README.md
CHANGED
@@ -15,12 +15,12 @@ pipeline_tag: text-generation
|
|
15 |
|
16 |
| | Quant type | File Size | ~Vram*|
|
17 |
| -------- | ---------- | --------- | -------- |
|
18 |
-
| [phi-4 hb8 3bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_3bpw) | 3
|
19 |
-
| [phi-4 hb8 4bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_4bpw) | 4
|
20 |
-
| [phi-4 hb8 5bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_5bpw) | 5
|
21 |
-
| [phi-4 hb8 6bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_6bpw) | 6
|
22 |
-
| [phi-4 hb8 7bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_7bpw) | 7
|
23 |
-
| [phi-4 hb8 8bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_8bpw) | 8
|
24 |
|
25 |
<sub>*approximate value at **16k context, FP16 cache**.<sup>
|
26 |
|
|
|
15 |
|
16 |
| | Quant type | File Size | ~Vram*|
|
17 |
| -------- | ---------- | --------- | -------- |
|
18 |
+
| [phi-4 hb8 3bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_3bpw) | 3 bits per weight | 6.66 GB | **10,3 GB** |
|
19 |
+
| [phi-4 hb8 4bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_4bpw) | 4 bits per weight | 8.36 GB | **11,9 GB** |
|
20 |
+
| [phi-4 hb8 5bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_5bpw) | 5 bits per weight | 10.1 GB | **13,5 GB** |
|
21 |
+
| [phi-4 hb8 6bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_6bpw) | 6 bits per weight | 11.8 GB | **15,1 GB** |
|
22 |
+
| [phi-4 hb8 7bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_7bpw) | 7 bits per weight | 13.5 GB | **16,7 GB** |
|
23 |
+
| [phi-4 hb8 8bpw](https://huggingface.co/cmh/phi-4_exl2/tree/hb8_8bpw) | 8 bits per weight | 15.2 GB | **18,2 GB** |
|
24 |
|
25 |
<sub>*approximate value at **16k context, FP16 cache**.<sup>
|
26 |
|