bgsach commited on
Commit
eae1d49
·
1 Parent(s): 7db477a

Upload folder using huggingface_hub

Browse files
Files changed (4) hide show
  1. README.md +13 -0
  2. config.json +6 -0
  3. model.bin +3 -0
  4. vocabulary.json +0 -0
README.md ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ This is a quantized version of **WizardLM/WizardCoder-Python-13B-V1.0**, quantized using [ctranslate2](https://github.com/OpenNMT/CTranslate2) (see inference instructions there).
3
+
4
+ **The license/caveats/intended usage is the same as the original model**.
5
+ The quality of its output may have
6
+ been negatively affected by the quantization process.
7
+
8
+
9
+ The command run to quantize the model was:
10
+
11
+ `ct2-transformers-converter --model ./models-hf/WizardLM/WizardCoder-Python-13B-V1.0 --quantization int8_float16 --output_dir ./models-ct/WizardLM/WizardCoder-Python-13B-V1.0-ct2-int8_float16`
12
+
13
+ The quantization was run on a 'high-mem', CPU only (8 core, 51GB) colab instance and took approximately 10 minutes.
config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": "</s>",
3
+ "eos_token": "</s>",
4
+ "layer_norm_epsilon": 1e-05,
5
+ "unk_token": "</s>"
6
+ }
model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3caa018e7e599d9e150683cf87a53116061b89621ca5e7f7a4763b049c9b2224
3
+ size 13025100473
vocabulary.json ADDED
The diff for this file is too large to render. See raw diff