FBAGSTM commited on
Commit
0b6b66d
·
verified ·
1 Parent(s): f0b1fa1

Update ST Model Zoo

Browse files
Files changed (1) hide show
  1. README.md +2 -11
README.md CHANGED
@@ -1,10 +1,3 @@
1
- ---
2
- license: other
3
- license_name: sla0044
4
- license_link: >-
5
- https://github.com/STMicroelectronics/stm32aimodelzoo/speech_enhancement/LICENSE.md
6
- pipeline_tag: audio-to-audio
7
- ---
8
  # STFT-TCNN
9
 
10
  ## **Use case** : `speech enhancement`
@@ -62,7 +55,7 @@ Measures are done with default STEDGEAI configuration with enabled input / outpu
62
  ### Reference **NPU** memory footprint
63
  |Model | Dataset | Format | Resolution | Series | Internal RAM | External RAM | Weights Flash | STM32Cube.AI version | STEdgeAI Core version |
64
  |----------|------------------|--------|-------------|------------------|------------------|---------------------|-------|----------------------|-------------------------|
65
- | [STFT-TCNN Medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/ST_pretrainedmodel_public_dataset/valentini/stft_tcnn_medium_sigmoid_257x40_qdq_int8.onnx) | valentini | Int8 | 257x40 | STM32N6 | 100.09 | 0.0 | 1626.86 | 10.0.0 | 2.0.0 |
66
 
67
  ### Reference **NPU** inference time
68
 
@@ -75,7 +68,7 @@ You can expect significant improvements once this issue is resolved.
75
 
76
  | Model | Dataset | Format | Resolution | Board | Execution Engine | Inference time (ms) | Inf / sec | STM32Cube.AI version | STEdgeAI Core version |
77
  |--------|------------------|--------|-------------|------------------|------------------|---------------------|-------|----------------------|-------------------------|
78
- | [STFT-TCNN medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/ST_pretrainedmodel_public_dataset/valentini/stft_tcnn_medium_sigmoid_257x40_qdq_int8.onnx) | valentini | Int8 | 257x40 | STM32N6570-DK | NPU/MCU | 53.76 | 18.6 | 10.0.0 | 2.0.0 |
79
 
80
 
81
  ### Metrics on the Valentini dataset
@@ -101,5 +94,3 @@ The models provided here typically have trouble denoising speech at SNRs beyond
101
 
102
  Please refer to the stm32ai-modelzoo-services GitHub [here](https://github.com/STMicroelectronics/stm32ai-modelzoo-services)
103
 
104
-
105
-
 
 
 
 
 
 
 
 
1
  # STFT-TCNN
2
 
3
  ## **Use case** : `speech enhancement`
 
55
  ### Reference **NPU** memory footprint
56
  |Model | Dataset | Format | Resolution | Series | Internal RAM | External RAM | Weights Flash | STM32Cube.AI version | STEdgeAI Core version |
57
  |----------|------------------|--------|-------------|------------------|------------------|---------------------|-------|----------------------|-------------------------|
58
+ | [STFT-TCNN Medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/ST_pretrainedmodel_public_dataset/valentini/stft_tcnn_medium_sigmoid_257x40_qdq_int8.onnx) | valentini | Int8 | 257x40 | STM32N6 | 100.09 | 0.0 | 1599.39 | 10.2.0 | 2.2.0 |
59
 
60
  ### Reference **NPU** inference time
61
 
 
68
 
69
  | Model | Dataset | Format | Resolution | Board | Execution Engine | Inference time (ms) | Inf / sec | STM32Cube.AI version | STEdgeAI Core version |
70
  |--------|------------------|--------|-------------|------------------|------------------|---------------------|-------|----------------------|-------------------------|
71
+ | [STFT-TCNN medium](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/speech_enhancement/stft_tcnn/ST_pretrainedmodel_public_dataset/valentini/stft_tcnn_medium_sigmoid_257x40_qdq_int8.onnx) | valentini | Int8 | 257x40 | STM32N6570-DK | NPU/MCU | 52.09 | 19.19 | 10.2.0 | 2.2.0 |
72
 
73
 
74
  ### Metrics on the Valentini dataset
 
94
 
95
  Please refer to the stm32ai-modelzoo-services GitHub [here](https://github.com/STMicroelectronics/stm32ai-modelzoo-services)
96