Update README.md
Browse files
README.md
CHANGED
|
@@ -4,7 +4,8 @@ datasets:
|
|
| 4 |
---
|
| 5 |
# Platypus2-70B-instruct-4bit-gptq
|
| 6 |
|
| 7 |
-
Platypus2-70B-instruct-4bit-gptq is a qunatnized version of [`garage-bAInd/Platypus2-70B-instruct`](https://huggingface.co/garage-bAInd/Platypus2-70B-instruct) using GPTQ Quantnization
|
|
|
|
| 8 |
|
| 9 |
### Benchmark Metrics
|
| 10 |
|
|
|
|
| 4 |
---
|
| 5 |
# Platypus2-70B-instruct-4bit-gptq
|
| 6 |
|
| 7 |
+
Platypus2-70B-instruct-4bit-gptq is a qunatnized version of [`garage-bAInd/Platypus2-70B-instruct`](https://huggingface.co/garage-bAInd/Platypus2-70B-instruct) using GPTQ Quantnization.
|
| 8 |
+
The model is only 35 GIB in size in comparision with the original garage-bAInd/Platypus2-70B-instruct 127 GIB in size and can run on a single GPU
|
| 9 |
|
| 10 |
### Benchmark Metrics
|
| 11 |
|