Update README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,34 @@
|
|
1 |
-
---
|
2 |
-
license: other
|
3 |
-
license_name: aplux-model-farm-license
|
4 |
-
license_link: https://aiot.aidlux.com/api/v1/files/license/model_farm_license_en.pdf
|
5 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: other
|
3 |
+
license_name: aplux-model-farm-license
|
4 |
+
license_link: https://aiot.aidlux.com/api/v1/files/license/model_farm_license_en.pdf
|
5 |
+
---
|
6 |
+
|
7 |
+

|
8 |
+
|
9 |
+
## Whisper-Small-En: ASR
|
10 |
+
|
11 |
+
Whisper-Small-En, developed by OpenAI, is a mid-sized English speech recognition model based on the Transformer architecture, scaling up parameters (~307M) beyond Tiny and Base versions to enhance transcription accuracy and contextual comprehension. It enables high-precision real-time speech-to-text conversion, multilingual translation, and voice command analysis, trained on extensive multimodal data to handle accents, background noise, and domain-specific terminology. Ideal for scenarios demanding reliability, such as professional meetings, medical dictation, legal documentation, or live multilingual translation, it balances efficiency and performance on mid-tier GPUs or cloud platforms. Challenges include managing long audio sequences, minimizing real-time latency, and optimizing computational resource allocation.
|
12 |
+
|
13 |
+
### Source model
|
14 |
+
|
15 |
+
- Input shape: [1x80x3000],[[1x1],[1x1],[12x12x64x1500],[12x12x1500x64],[12x12x64x224],[12x12x224x64]]
|
16 |
+
- Number of parameters: 102M, 139M
|
17 |
+
- Model size: 390M, 531M
|
18 |
+
- Output shape: [[12x12x64x1500],[12x12x1500x64]],[[1x1x51864],[12x12x64x224],[12x12x224x64]]
|
19 |
+
|
20 |
+
The source model can be found [here](https://github.com/openai/whisper/tree/main)
|
21 |
+
|
22 |
+
## Performance Reference
|
23 |
+
|
24 |
+
Please search model by model name in [Model Farm](https://aiot.aidlux.com/en/models)
|
25 |
+
|
26 |
+
## Inference & Model Conversion
|
27 |
+
|
28 |
+
Please search model by model name in [Model Farm](https://aiot.aidlux.com/en/models)
|
29 |
+
|
30 |
+
## License
|
31 |
+
|
32 |
+
- Source Model: [MIT](https://github.com/openai/whisper/blob/main/LICENSE)
|
33 |
+
|
34 |
+
- Deployable Model: [APLUX-MODEL-FARM-LICENSE](https://aiot.aidlux.com/api/v1/files/license/model_farm_license_en.pdf)
|