Add/update the quantized ONNX model files and README.md for Transformers.js v3

by whitphx HF Staff - opened Jun 24

←

Owner Jun 24

Applied Quantizations

↳ ✅ q4f16: decoder_with_past_model_q4f16.onnx (added)

↳ ✅ q4f16: decoder_model_q4f16.onnx (added)

↳ ✅ q4f16: encoder_model_q4f16.onnx (added)

↳ ✅ fp16: decoder_model_merged_fp16.onnx (replaced because it was invalid)
↳ ✅ q4f16: decoder_model_merged_q4f16.onnx (added)

Owner Aug 29

whitphx changed pull request status to closed Aug 29

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment