Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Xenova
/
llama-68m
like
0
Text Generation
Transformers.js
ONNX
llama
Model card
Files
Files and versions
xet
Community
1
Use this model
main
llama-68m
/
onnx
Ctrl+K
Ctrl+K
2 contributors
History:
3 commits
Xenova
HF Staff
Add/update the quantized ONNX model files and README.md for Transformers.js v3 (
#1
)
1fb7da2
verified
about 23 hours ago
decoder_model.onnx
Safe
273 MB
xet
Upload folder using huggingface_hub
almost 2 years ago
decoder_model_fp16.onnx
Safe
137 MB
xet
Upload fp16 ONNX weights
over 1 year ago
decoder_model_merged.onnx
Safe
274 MB
xet
Upload folder using huggingface_hub
almost 2 years ago
decoder_model_merged_quantized.onnx
Safe
70.4 MB
xet
Upload folder using huggingface_hub
almost 2 years ago
decoder_model_quantized.onnx
Safe
69.2 MB
xet
Upload folder using huggingface_hub
almost 2 years ago
decoder_with_past_model.onnx
Safe
273 MB
xet
Upload folder using huggingface_hub
almost 2 years ago
decoder_with_past_model_quantized.onnx
Safe
69.2 MB
xet
Upload folder using huggingface_hub
almost 2 years ago
model.onnx
Safe
274 MB
xet
Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#1)
about 23 hours ago
model_bnb4.onnx
124 MB
xet
Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#1)
about 23 hours ago
model_fp16.onnx
137 MB
xet
Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#1)
about 23 hours ago
model_int8.onnx
69.2 MB
xet
Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#1)
about 23 hours ago
model_q4.onnx
127 MB
xet
Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#1)
about 23 hours ago
model_q4f16.onnx
74.2 MB
xet
Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#1)
about 23 hours ago
model_uint8.onnx
69.2 MB
xet
Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#1)
about 23 hours ago