v0.30.5
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.30.5 for changelog.
- README.md +68 -66
- VIT_w8a16.onnx +2 -2
- VIT_w8a8.onnx +2 -2
README.md
CHANGED
@@ -35,68 +35,70 @@ More details on model performance across various devices, can be found
|
|
35 |
|
36 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
37 |
|---|---|---|---|---|---|---|---|---|
|
38 |
-
| VIT | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE |
|
39 |
-
| VIT | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN |
|
40 |
-
| VIT | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE |
|
41 |
-
| VIT | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 21.
|
42 |
-
| VIT | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 13.
|
43 |
-
| VIT | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 13.
|
44 |
-
| VIT | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 16.
|
45 |
-
| VIT | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN | 16.
|
46 |
-
| VIT | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE |
|
47 |
-
| VIT | float | SA7255P ADP | Qualcomm® SA7255P | QNN |
|
48 |
-
| VIT | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE |
|
49 |
-
| VIT | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN | 13.
|
50 |
-
| VIT | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE |
|
51 |
-
| VIT | float | SA8295P ADP | Qualcomm® SA8295P | QNN |
|
52 |
-
| VIT | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE |
|
53 |
-
| VIT | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN | 13.
|
54 |
-
| VIT | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 16.
|
55 |
-
| VIT | float | SA8775P ADP | Qualcomm® SA8775P | QNN | 16.
|
56 |
-
| VIT | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE |
|
57 |
-
| VIT | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 13.
|
58 |
-
| VIT | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 13.
|
59 |
-
| VIT | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 9.
|
60 |
-
| VIT | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 9.
|
61 |
-
| VIT | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 9.
|
62 |
-
| VIT | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE |
|
63 |
-
| VIT | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN |
|
64 |
-
| VIT | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX |
|
65 |
-
| VIT | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 14.
|
66 |
-
| VIT | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 14.
|
67 |
-
| VIT | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX |
|
68 |
-
| VIT | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX |
|
69 |
-
| VIT |
|
70 |
-
| VIT |
|
71 |
-
| VIT | w8a8 |
|
72 |
-
| VIT | w8a8 |
|
73 |
-
| VIT | w8a8 |
|
74 |
-
| VIT | w8a8 |
|
75 |
-
| VIT | w8a8 |
|
76 |
-
| VIT | w8a8 |
|
77 |
-
| VIT | w8a8 |
|
78 |
-
| VIT | w8a8 |
|
79 |
-
| VIT | w8a8 |
|
80 |
-
| VIT | w8a8 |
|
81 |
-
| VIT | w8a8 |
|
82 |
-
| VIT | w8a8 |
|
83 |
-
| VIT | w8a8 |
|
84 |
-
| VIT | w8a8 |
|
85 |
-
| VIT | w8a8 |
|
86 |
-
| VIT | w8a8 |
|
87 |
-
| VIT | w8a8 |
|
88 |
-
| VIT | w8a8 |
|
89 |
-
| VIT | w8a8 |
|
90 |
-
| VIT | w8a8 |
|
91 |
-
| VIT | w8a8 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile |
|
92 |
-
| VIT | w8a8 | Samsung Galaxy
|
93 |
-
| VIT | w8a8 | Samsung Galaxy
|
94 |
-
| VIT | w8a8 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile |
|
95 |
-
| VIT | w8a8 |
|
96 |
-
| VIT | w8a8 |
|
97 |
-
| VIT | w8a8 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile |
|
98 |
-
| VIT | w8a8 | Snapdragon
|
99 |
-
| VIT | w8a8 | Snapdragon
|
|
|
|
|
100 |
|
101 |
|
102 |
|
@@ -160,8 +162,8 @@ Profiling Results
|
|
160 |
VIT
|
161 |
Device : cs_8275 (ANDROID 14)
|
162 |
Runtime : TFLITE
|
163 |
-
Estimated inference time (ms) :
|
164 |
-
Estimated peak memory usage (MB): [0,
|
165 |
Total # Ops : 1579
|
166 |
Compute Unit(s) : npu (1579 ops) gpu (0 ops) cpu (0 ops)
|
167 |
```
|
@@ -250,13 +252,13 @@ AI Hub. [Sign up for access](https://myaccount.qualcomm.com/signup).
|
|
250 |
You can also run the demo on-device.
|
251 |
|
252 |
```bash
|
253 |
-
python -m qai_hub_models.models.vit.demo --on-device
|
254 |
```
|
255 |
|
256 |
**NOTE**: If you want running in a Jupyter Notebook or Google Colab like
|
257 |
environment, please add the following to your cell (instead of the above).
|
258 |
```
|
259 |
-
%run -m qai_hub_models.models.vit.demo -- --on-device
|
260 |
```
|
261 |
|
262 |
|
|
|
35 |
|
36 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
37 |
|---|---|---|---|---|---|---|---|---|
|
38 |
+
| VIT | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 249.1 ms | 0 - 315 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
39 |
+
| VIT | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 248.995 ms | 1 - 10 MB | NPU | Use Export Script |
|
40 |
+
| VIT | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 17.91 ms | 0 - 317 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
41 |
+
| VIT | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 21.232 ms | 0 - 316 MB | NPU | Use Export Script |
|
42 |
+
| VIT | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 13.083 ms | 0 - 15 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
43 |
+
| VIT | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 13.865 ms | 1 - 3 MB | NPU | Use Export Script |
|
44 |
+
| VIT | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 16.025 ms | 0 - 315 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
45 |
+
| VIT | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN | 16.89 ms | 1 - 10 MB | NPU | Use Export Script |
|
46 |
+
| VIT | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 249.1 ms | 0 - 315 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
47 |
+
| VIT | float | SA7255P ADP | Qualcomm® SA7255P | QNN | 248.995 ms | 1 - 10 MB | NPU | Use Export Script |
|
48 |
+
| VIT | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 13.106 ms | 0 - 15 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
49 |
+
| VIT | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN | 13.886 ms | 1 - 4 MB | NPU | Use Export Script |
|
50 |
+
| VIT | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 20.029 ms | 0 - 307 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
51 |
+
| VIT | float | SA8295P ADP | Qualcomm® SA8295P | QNN | 19.918 ms | 1 - 16 MB | NPU | Use Export Script |
|
52 |
+
| VIT | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 13.131 ms | 0 - 20 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
53 |
+
| VIT | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN | 13.943 ms | 1 - 4 MB | NPU | Use Export Script |
|
54 |
+
| VIT | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 16.025 ms | 0 - 315 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
55 |
+
| VIT | float | SA8775P ADP | Qualcomm® SA8775P | QNN | 16.89 ms | 1 - 10 MB | NPU | Use Export Script |
|
56 |
+
| VIT | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 12.749 ms | 0 - 34 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
57 |
+
| VIT | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 13.849 ms | 0 - 28 MB | NPU | Use Export Script |
|
58 |
+
| VIT | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 13.585 ms | 0 - 293 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT.onnx) |
|
59 |
+
| VIT | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 9.013 ms | 0 - 323 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
60 |
+
| VIT | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 9.579 ms | 1 - 333 MB | NPU | Use Export Script |
|
61 |
+
| VIT | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 9.249 ms | 138 - 474 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT.onnx) |
|
62 |
+
| VIT | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 8.255 ms | 0 - 319 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT.tflite) |
|
63 |
+
| VIT | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 8.028 ms | 1 - 312 MB | NPU | Use Export Script |
|
64 |
+
| VIT | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 7.826 ms | 1 - 321 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT.onnx) |
|
65 |
+
| VIT | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 14.611 ms | 1 - 1 MB | NPU | Use Export Script |
|
66 |
+
| VIT | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 14.889 ms | 171 - 171 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT.onnx) |
|
67 |
+
| VIT | w8a16 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 154.348 ms | 652 - 904 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a16.onnx) |
|
68 |
+
| VIT | w8a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 120.86 ms | 672 - 832 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a16.onnx) |
|
69 |
+
| VIT | w8a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 116.224 ms | 678 - 809 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a16.onnx) |
|
70 |
+
| VIT | w8a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 169.671 ms | 922 - 922 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a16.onnx) |
|
71 |
+
| VIT | w8a8 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 53.751 ms | 0 - 51 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
72 |
+
| VIT | w8a8 | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN | 205.911 ms | 0 - 10 MB | NPU | Use Export Script |
|
73 |
+
| VIT | w8a8 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 13.227 ms | 0 - 59 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
74 |
+
| VIT | w8a8 | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN | 14.52 ms | 0 - 201 MB | NPU | Use Export Script |
|
75 |
+
| VIT | w8a8 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 12.431 ms | 0 - 44 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
76 |
+
| VIT | w8a8 | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN | 10.613 ms | 0 - 2 MB | NPU | Use Export Script |
|
77 |
+
| VIT | w8a8 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 12.873 ms | 0 - 51 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
78 |
+
| VIT | w8a8 | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN | 9.671 ms | 0 - 10 MB | NPU | Use Export Script |
|
79 |
+
| VIT | w8a8 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | TFLITE | 51.37 ms | 2 - 45 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
80 |
+
| VIT | w8a8 | RB3 Gen 2 (Proxy) | Qualcomm® QCS6490 (Proxy) | QNN | 79.083 ms | 0 - 12 MB | NPU | Use Export Script |
|
81 |
+
| VIT | w8a8 | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 53.751 ms | 0 - 51 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
82 |
+
| VIT | w8a8 | SA7255P ADP | Qualcomm® SA7255P | QNN | 205.911 ms | 0 - 10 MB | NPU | Use Export Script |
|
83 |
+
| VIT | w8a8 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 12.564 ms | 0 - 21 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
84 |
+
| VIT | w8a8 | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN | 10.668 ms | 0 - 2 MB | NPU | Use Export Script |
|
85 |
+
| VIT | w8a8 | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 15.111 ms | 0 - 54 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
86 |
+
| VIT | w8a8 | SA8295P ADP | Qualcomm® SA8295P | QNN | 16.868 ms | 0 - 17 MB | NPU | Use Export Script |
|
87 |
+
| VIT | w8a8 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 12.595 ms | 0 - 19 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
88 |
+
| VIT | w8a8 | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN | 10.645 ms | 0 - 2 MB | NPU | Use Export Script |
|
89 |
+
| VIT | w8a8 | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 12.873 ms | 0 - 51 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
90 |
+
| VIT | w8a8 | SA8775P ADP | Qualcomm® SA8775P | QNN | 9.671 ms | 0 - 10 MB | NPU | Use Export Script |
|
91 |
+
| VIT | w8a8 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 12.493 ms | 0 - 20 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
92 |
+
| VIT | w8a8 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN | 10.598 ms | 0 - 32 MB | NPU | Use Export Script |
|
93 |
+
| VIT | w8a8 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 28.306 ms | 0 - 114 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.onnx) |
|
94 |
+
| VIT | w8a8 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 8.944 ms | 0 - 56 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
95 |
+
| VIT | w8a8 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 7.082 ms | 0 - 163 MB | NPU | Use Export Script |
|
96 |
+
| VIT | w8a8 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 19.695 ms | 0 - 277 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.onnx) |
|
97 |
+
| VIT | w8a8 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 6.133 ms | 0 - 58 MB | NPU | [VIT.tflite](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.tflite) |
|
98 |
+
| VIT | w8a8 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 5.766 ms | 0 - 263 MB | NPU | Use Export Script |
|
99 |
+
| VIT | w8a8 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 14.788 ms | 0 - 254 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.onnx) |
|
100 |
+
| VIT | w8a8 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 11.218 ms | 1 - 1 MB | NPU | Use Export Script |
|
101 |
+
| VIT | w8a8 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 34.257 ms | 88 - 88 MB | NPU | [VIT.onnx](https://huggingface.co/qualcomm/VIT/blob/main/VIT_w8a8.onnx) |
|
102 |
|
103 |
|
104 |
|
|
|
162 |
VIT
|
163 |
Device : cs_8275 (ANDROID 14)
|
164 |
Runtime : TFLITE
|
165 |
+
Estimated inference time (ms) : 249.1
|
166 |
+
Estimated peak memory usage (MB): [0, 315]
|
167 |
Total # Ops : 1579
|
168 |
Compute Unit(s) : npu (1579 ops) gpu (0 ops) cpu (0 ops)
|
169 |
```
|
|
|
252 |
You can also run the demo on-device.
|
253 |
|
254 |
```bash
|
255 |
+
python -m qai_hub_models.models.vit.demo --eval-mode on-device
|
256 |
```
|
257 |
|
258 |
**NOTE**: If you want running in a Jupyter Notebook or Google Colab like
|
259 |
environment, please add the following to your cell (instead of the above).
|
260 |
```
|
261 |
+
%run -m qai_hub_models.models.vit.demo -- --eval-mode on-device
|
262 |
```
|
263 |
|
264 |
|
VIT_w8a16.onnx
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ca1f62dd5209d2e9781b7328150ba225b195a0a1b46fbb420b08a7b754bd7772
|
3 |
+
size 347828616
|
VIT_w8a8.onnx
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:27990f99527adf4cf1acc8da76740f20cc045b7dd465b2fb68134fbad16fc658
|
3 |
+
size 347778767
|