Yuantao Feng
commited on
Commit
·
2ef6bc9
1
Parent(s):
3af1dea
Add hardware GPU CUDA: Jetson Nano (#8)
Browse files* update benchmark results on Jetson Nano
README.md
CHANGED
@@ -11,30 +11,23 @@ Guidelines:
|
|
11 |
Hardware Setup:
|
12 |
- `CPU x86_64`: INTEL CPU i7-5930K @ 3.50GHz, 6 cores, 12 threads.
|
13 |
- `CPU ARM`: Raspberry 4B, BCM2711B0 @ 1.5GHz (Cortex A-72), 4 cores, 4 threads.
|
14 |
-
<!--
|
15 |
- `GPU CUDA`: NVIDIA Jetson Nano B01, 128-core Maxwell, Quad-core ARM A57 @ 1.43 GHz.
|
16 |
-
-->
|
17 |
|
18 |
***Important Notes***:
|
19 |
- The time data that shown on the following table presents the time elapsed from preprocess (resize is excluded), to a forward pass of a network, and postprocess to get final results.
|
20 |
- The time data that shown on the following table is the median of 10 runs. Different metrics may be applied to some specific models.
|
21 |
- View [benchmark/config](./benchmark/config) for more details on benchmarking different models.
|
22 |
|
23 |
-
|
24 |
-
| Model | Input Size | CPU x86_64 (ms) | CPU ARM (ms) | GPU CUDA (ms) |
|
25 |
|-------|------------|-----------------|--------------|---------------|
|
26 |
-
| [YuNet](./models/face_detection_yunet)
|
27 |
-
| [DB](./models/text_detection_db)
|
28 |
-
| [CRNN](./models/text_recognition_crnn)
|
29 |
-
|
30 |
-
|
|
31 |
-
|
32 |
-
|
33 |
-
|
34 |
-
| [CRNN](./models/text_recognition_crnn) | 100x32 | 50.21 | 234.32 |
|
35 |
-
| [SFace](./models/face_recognition_sface) | 112x112 | 8.65 | 99.20 |
|
36 |
-
| [PP-ResNet](./models/image_classification_ppresnet) | 224x224 | 56.05 | 602.58
|
37 |
-
| [PP-HumanSeg](./models/human_segmentation_pphumanseg) | 192x192 | 19.92 | 105.32 |
|
38 |
|
39 |
## License
|
40 |
|
|
|
11 |
Hardware Setup:
|
12 |
- `CPU x86_64`: INTEL CPU i7-5930K @ 3.50GHz, 6 cores, 12 threads.
|
13 |
- `CPU ARM`: Raspberry 4B, BCM2711B0 @ 1.5GHz (Cortex A-72), 4 cores, 4 threads.
|
|
|
14 |
- `GPU CUDA`: NVIDIA Jetson Nano B01, 128-core Maxwell, Quad-core ARM A57 @ 1.43 GHz.
|
|
|
15 |
|
16 |
***Important Notes***:
|
17 |
- The time data that shown on the following table presents the time elapsed from preprocess (resize is excluded), to a forward pass of a network, and postprocess to get final results.
|
18 |
- The time data that shown on the following table is the median of 10 runs. Different metrics may be applied to some specific models.
|
19 |
- View [benchmark/config](./benchmark/config) for more details on benchmarking different models.
|
20 |
|
21 |
+
| Model | Input Size | CPU x86_64 (ms) | CPU ARM (ms) | GPU CUDA (ms)* |
|
|
|
22 |
|-------|------------|-----------------|--------------|---------------|
|
23 |
+
| [YuNet](./models/face_detection_yunet) | 160x120 | 1.45 | 6.22 | 12.18 |
|
24 |
+
| [DB](./models/text_detection_db) | 640x480 | 137.38 | 2780.78 | 220.52 |
|
25 |
+
| [CRNN](./models/text_recognition_crnn) | 100x32 | 50.21 | 234.32 | 196.15 |
|
26 |
+
| [SFace](./models/face_recognition_sface) | 112x112 | 8.65 | 99.20 | 24.88 |
|
27 |
+
| [PP-ResNet](./models/image_classification_ppresnet) | 224x224 | 56.05 | 602.58 | 98.64 |
|
28 |
+
| [PP-HumanSeg](./models/human_segmentation_pphumanseg) | 192x192 | 19.92 | 105.32 | 67.97 |
|
29 |
+
|
30 |
+
*: Batch size is 1.
|
|
|
|
|
|
|
|
|
31 |
|
32 |
## License
|
33 |
|