Upload folder using huggingface_hub

Files changed (4) hide show

README.md ADDED Viewed

+This is a quantized version of **WizardLM/WizardCoder-Python-13B-V1.0**, quantized using [ctranslate2](https://github.com/OpenNMT/CTranslate2) (see inference instructions there).
+**The license/caveats/intended usage is the same as the original model**.
+The quality of its output may have
+been negatively affected by the quantization process.
+The command run to quantize the model was:
+ `ct2-transformers-converter --model ./models-hf/WizardLM/WizardCoder-Python-13B-V1.0 --quantization int8_float16 --output_dir ./models-ct/WizardLM/WizardCoder-Python-13B-V1.0-ct2-int8_float16`
+The quantization was run on a 'high-mem', CPU only (8 core, 51GB) colab instance and took approximately 10 minutes.

config.json ADDED Viewed

+{
+  "bos_token": "</s>",
+  "eos_token": "</s>",
+  "layer_norm_epsilon": 1e-05,
+  "unk_token": "</s>"
+}

model.bin ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:3caa018e7e599d9e150683cf87a53116061b89621ca5e7f7a4763b049c9b2224
+size 13025100473

vocabulary.json ADDED Viewed

The diff for this file is too large to render. See raw diff