Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -169,18 +169,6 @@ Hello! As an AI, I don't have consciousness in the way humans do, but I am fully
 We rely on [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) to evaluate the quality of the quantized model.
-Need to install lm-eval from source: https://github.com/EleutherAI/lm-evaluation-harness#install
-## baseline
-```Shell
-lm_eval --model hf --model_args pretrained=microsoft/Phi-4-mini-instruct --tasks hellaswag --device cuda:0 --batch_size 8
-```
-## int8 dynamic activation and int4 weight quantization (8da4w)
-```Shell
-lm_eval --model hf --model_args pretrained=pytorch/Phi-4-mini-instruct-8da4w --tasks hellaswag --device cuda:0 --batch_size 8
-```
 | Benchmark                        |                |                           |
 |----------------------------------|----------------|---------------------------|
 |                                  | Phi-4-mini-ins | Phi-4-mini-instruct-8da4w |
@@ -203,6 +191,21 @@ lm_eval --model hf --model_args pretrained=pytorch/Phi-4-mini-instruct-8da4w --t
 | Mathqa (0-shot)                  | 42.31          | 36.95                     |
 | **Overall**                      | 55.35          | 48.45                     |
 # Exporting to ExecuTorch

 We rely on [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) to evaluate the quality of the quantized model.
 | Benchmark                        |                |                           |
 |----------------------------------|----------------|---------------------------|
 |                                  | Phi-4-mini-ins | Phi-4-mini-instruct-8da4w |
 | Mathqa (0-shot)                  | 42.31          | 36.95                     |
 | **Overall**                      | 55.35          | 48.45                     |
+<details>
+<summary> Reproduce Model Quality Results </summary>
+Need to install lm-eval from source: https://github.com/EleutherAI/lm-evaluation-harness#install
+## baseline
+```Shell
+lm_eval --model hf --model_args pretrained=microsoft/Phi-4-mini-instruct --tasks hellaswag --device cuda:0 --batch_size 8
+```
+## int8 dynamic activation and int4 weight quantization (8da4w)
+```Shell
+lm_eval --model hf --model_args pretrained=pytorch/Phi-4-mini-instruct-8da4w --tasks hellaswag --device cuda:0 --batch_size 8
+```
+</details>
 # Exporting to ExecuTorch