zeroshot
/

gte-large-quant

Feature Extraction

sparse sparsity quantized onnx embeddings int8

text-embeddings-inference

Model card Files Files and versions

zeroshot commited on Oct 15, 2023

Commit

c0a96cd

·

1 Parent(s): 00f763a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ language:
 - en
 ---
-# gte-large-large
 This is the quantized (INT8) ONNX variant of the [gte-large](https://huggingface.co/thenlper/gte-large) embeddings model created with [DeepSparse Optimum](https://github.com/neuralmagic/optimum-deepsparse) for ONNX export/inference and Neural Magic's [Sparsify](https://github.com/neuralmagic/sparsify) for one-shot quantization.

 - en
 ---
+# gte-large-quant
 This is the quantized (INT8) ONNX variant of the [gte-large](https://huggingface.co/thenlper/gte-large) embeddings model created with [DeepSparse Optimum](https://github.com/neuralmagic/optimum-deepsparse) for ONNX export/inference and Neural Magic's [Sparsify](https://github.com/neuralmagic/sparsify) for one-shot quantization.