malhajar
/

Platypus2-70B-instruct-4bit-gptq

Text Generation

text-generation-inference

Model card Files Files and versions

malhajar commited on Aug 23, 2023

Commit

296e89b

·

1 Parent(s): 5884758

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -4,7 +4,8 @@ datasets:
 ---
 # Platypus2-70B-instruct-4bit-gptq
-Platypus2-70B-instruct-4bit-gptq is a qunatnized version of [`garage-bAInd/Platypus2-70B-instruct`](https://huggingface.co/garage-bAInd/Platypus2-70B-instruct) using GPTQ Quantnization
 ### Benchmark Metrics

 ---
 # Platypus2-70B-instruct-4bit-gptq
+Platypus2-70B-instruct-4bit-gptq is a qunatnized version of [`garage-bAInd/Platypus2-70B-instruct`](https://huggingface.co/garage-bAInd/Platypus2-70B-instruct) using GPTQ Quantnization.
+The model is only 35 GIB in size in comparision with the original garage-bAInd/Platypus2-70B-instruct 127 GIB in size and can run on a single GPU
 ### Benchmark Metrics