CausalLM
/

7B

Text Generation

text-generation-inference

Model card Files Files and versions

JosephusCheung commited on Oct 23, 2023

Commit

d82a6a4

·

1 Parent(s): 96379e3

Update README.md

Files changed (1) hide show

README.md +6 -5

README.md CHANGED Viewed

@@ -34,13 +34,13 @@ tags:
 *Image drawn by GPT-4 DALL·E 3* TL;DR: Perhaps this 7B model, better than all existing models <= 33B, in most quantitative evaluations...
-**Some problems with llama.cpp on GPT2Tokenizer, gotta fix soon...**
 # Please Stop Using WRONG unofficial quant models unless you know what you're doing
-GPTQ quants require a good dataset for calibration, and the default C4 dataset is not capable - [see the releated issue](https://huggingface.co/CausalLM/14B/discussions/3)
-**Some problems with llama.cpp on GPT2Tokenizer, gotta fix soon...**
 ## Read Me:
@@ -91,7 +91,8 @@ Hard acc:48.03
 **Zero-shot ACC 0.5921152388172858** (Outperforms WizardMath-7B and Qwen-7B)
-**GPT2Tokenizer 上的 llama.cpp 存在一些问题，会尽快修复...**
 ## 请读我：

 *Image drawn by GPT-4 DALL·E 3* TL;DR: Perhaps this 7B model, better than all existing models <= 33B, in most quantitative evaluations...
 # Please Stop Using WRONG unofficial quant models unless you know what you're doing
+GPTQ quants require a good dataset for calibration, and the default C4 dataset is not capable.
+**llama.cpp GGUF models**
+GPT2Tokenizer fixed by [Kerfuffle](https://github.com/KerfuffleV2) on [https://github.com/ggerganov/llama.cpp/pull/3743](https://github.com/ggerganov/llama.cpp/pull/3743), new models to be reuploaded.
 ## Read Me:
 **Zero-shot ACC 0.5921152388172858** (Outperforms WizardMath-7B and Qwen-7B)
+**llama.cpp GGUF models**
+GPT2Tokenizer 支持由 [Kerfuffle](https://github.com/KerfuffleV2) 修复于 [https://github.com/ggerganov/llama.cpp/pull/3743](https://github.com/ggerganov/llama.cpp/pull/3743)，新模型稍后上传。
 ## 请读我：