|
license: apache-2.0 |
|
--- |
|
# GreenBit Yi |
|
|
|
This is GreenBitAI's pretrained **2-bit** Yi 6B model with extreme compression yet still strong performance. |
|
|
|
Please refer to our [Github page](https://github.com/GreenBitAI/low_bit_llama) for the code to run the model and more information. |
|
|
|
## Model Description |
|
|
|
- **Developed by:** [GreenBitAI](https://github.com/GreenBitAI) |
|
- **Model type:** Causal (Llama 2/Yi 6B) |
|
- **Language(s) (NLP):** English |
|
- **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0), [Llama 2 license agreement](https://ai.meta.com/resources/models-and-libraries/llama-downloads/) |
|
|
|
## Zero-Shot Evaluation |
|
| Task | Metric | FP16 | Yi-6B w4a16g32 | Yi-6B w2a16g8 | Yi-6B w2a16g32| |
|
|-----------------|--------|-------|-------------|----------------------|-----------------------| |
|
| Openbookqa | acc | 0.314 | 0.324 | 0.26 | 0.228 | |
|
| | ac_norm| 0.408 | 0.42 | 0.394 | 0.352 | |
|
| arc_challenge | acc | 0.462 | 0.4573 | 0.4082 | 0.337 | |
|
| | ac_norm| 0.504 | 0.483 | 0.4249 | 0.3523 | |
|
| hellawswag | acc | 0.553 | 0.5447 | 0.5083 | 0.4326 | |
|
| | ac_norm| 0.749 | 0.7327 | 0.6909 | 0.58 | |
|
| piqa | acc | 0.777 | 0.7709 | 0.7535 | 0.7051 | |
|
| | ac_norm| 0.787 | 0.7894 | 0.7655 | 0.7143 | |
|
| arc_easy | acc | 0.777 | 0.7697 | 0.7373 | 0.6523 | |
|
| | ac_norm| 0.774 | 0.7659 | 0.7314 | 0.6115 | |
|
| Winogrande | acc | 0.707 | 0.7095 | 0.6803 | 0.6219 | |
|
| boolq | acc | 0.755 | 0.7648 | 0.7507 | 0.732 | |
|
| truthfulqa_mc | mc1 | 0.29 | 0.2729 | 0.2753 | 0.219 | |
|
| | mc2 | 0.419 | 0.4033 | 0.4156 | 0.3479 | |
|
| anli_r1 | acc | 0.423 | 0.416 | 0.383 | 0.38 | |
|
| anli_r2 | acc | 0.409 | 0.409 | 0.387 | 0.374 | |
|
| anli_r3 | acc | 0.411 | 0.393 | 0.38 | 0.3475 | |
|
| wic | acc | 0.529 | 0.545 | 0.515 | 0.5 | |
|
| rte | acc | 0.685 | 0.7039 | 0.7436 | 0.6787 | |
|
| record | f1 | 0.904 | 0.9011 | 0.8906 | 0.8521 | |
|
| | em | 0.8962| 0.8927 | 0.8819 | 0.8429 | |
|
| Average | | 0.596 | 0.5937 | 0.5703 | 0.517 | |
|
| wikitext2 (2048)| ppl | 5.841 | 6.01 | 6.57 | 8.06 | |
|
| ptb (2048) | ppl | 18.93 | 19.76 | 25.9 | 35.3 | |
|
| Model Size | GiB | 12.12 | 3.76 | 3.09 | 2.5 | |
|
|
|
|