malhajar
/

Platypus2-70B-instruct-4bit-gptq

Text Generation

text-generation-inference

Model card Files Files and versions

malhajar commited on Aug 23, 2023

Commit

7679d2f

·

1 Parent(s): c3ecd0f

Create README.md

Files changed (1) hide show

README.md +72 -0

README.md ADDED Viewed

	@@ -0,0 +1,72 @@

+---
+datasets:
+- yahma/alpaca-cleaned
+---
+---
+language:
+- en
+datasets:
+license: cc-by-nc-4.0
+---
+# Platypus2-70B-instruct-4bit-gptq
+Platypus2-70B-instruct-4bit-gptq is a qunatnized version of [`garage-bAInd/Platypus2-70B-instruct`](https://huggingface.co/garage-bAInd/Platypus2-70B-instruct) using GPTQ Quantnization
+### Benchmark Metrics
+will report soon
+### Model Details
+* **Trained by**: **Platypus2-70B-instruct-4bit-gptq** quantnized by Mohamad [email protected] ;
+* **Model type:**  **Platypus2-70B-instruct-4bit-gptq** is a quantnized version of Platypus2-70B-instruct using 4bit quantnization
+* **Language(s)**: English
+* **License**: Non-Commercial Creative Commons license ([CC BY-NC-4.0](https://creativecommons.org/licenses/by-nc/4.0/))
+### Prompt Template
+```
+### Instruction:
+<prompt> (without the <>)
+### Response:
+```
+### Training Dataset
+`Platypus2-70B-instruct-4bit-gptq` quantnized using gptq on Alpaca dataset [`yahma/alpaca-cleaned`](https://huggingface.co/datasets/yahma/alpaca-cleaned).
+### Training Procedure
+`garage-bAInd/Platypus2-70B` was instruction fine-tuned using gptq on 2 L40 48GB.
+### Citations
+```bibtex
+@article{platypus2023,
+    title={Platypus: Quick, Cheap, and Powerful Refinement of LLMs},
+    author={Ariel N. Lee and Cole J. Hunter and Nataniel Ruiz},
+    booktitle={arXiv preprint arxiv:2308.07317},
+    year={2023}
+}
+```
+```bibtex
+@misc{touvron2023llama,
+    title={Llama 2: Open Foundation and Fine-Tuned Chat Models},
+    author={Hugo Touvron and Louis Martin and Kevin Stone and Peter Albert and Amjad Almahairi and Yasmine Babaei and Nikolay Bashlykov       year={2023},
+    eprint={2307.09288},
+    archivePrefix={arXiv},
+}
+```
+```bibtex
+@misc{frantar2023gptq,
+      title={GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers},
+      author={Elias Frantar and Saleh Ashkboos and Torsten Hoefler and Dan Alistarh},
+      year={2023},
+      eprint={2210.17323},
+      archivePrefix={arXiv},
+      primaryClass={cs.LG}
+}
+```