nm-research commited on
Commit
c749591
·
verified ·
1 Parent(s): 1460134

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -42,7 +42,7 @@ tags:
42
 
43
  ### Model Optimizations
44
 
45
- This model was obtained by quantizing the weights of [phi-4]https://huggingface.co/microsoft/phi-4) to INT4 data type.
46
  This optimization reduces the number of bits per parameter from 16 to 4, reducing the disk size and GPU memory requirements by approximately 75%.
47
 
48
  Only the weights of the linear operators within transformers blocks are quantized.
 
42
 
43
  ### Model Optimizations
44
 
45
+ This model was obtained by quantizing the weights of [phi-4](https://huggingface.co/microsoft/phi-4) to INT4 data type.
46
  This optimization reduces the number of bits per parameter from 16 to 4, reducing the disk size and GPU memory requirements by approximately 75%.
47
 
48
  Only the weights of the linear operators within transformers blocks are quantized.