File size: 1,993 Bytes
df0966b
 
 
 
926f207
 
 
 
df0966b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
---
license: other
license_name: aplux-model-farm-license
license_link: https://aiot.aidlux.com/api/v1/files/license/model_farm_license_en.pdf
pipeline_tag: automatic-speech-recognition
tags:
- AIoT
- QNN
---

![](https://aiot.aidlux.com/_next/image?url=%2Fapi%2Fv1%2Ffiles%2Fmodel%2Fcover%2F20250326113143__20250326191619.png&w=640&q=75)

## Whisper-Small-En: ASR

Whisper-Small-En, developed by OpenAI, is a mid-sized English speech recognition model based on the Transformer architecture, scaling up parameters (~307M) beyond Tiny and Base versions to enhance transcription accuracy and contextual comprehension. It enables high-precision real-time speech-to-text conversion, multilingual translation, and voice command analysis, trained on extensive multimodal data to handle accents, background noise, and domain-specific terminology. Ideal for scenarios demanding reliability, such as professional meetings, medical dictation, legal documentation, or live multilingual translation, it balances efficiency and performance on mid-tier GPUs or cloud platforms. Challenges include managing long audio sequences, minimizing real-time latency, and optimizing computational resource allocation.

### Source model

- Input shape: [1x80x3000],[[1x1],[1x1],[12x12x64x1500],[12x12x1500x64],[12x12x64x224],[12x12x224x64]]
- Number of parameters: 102M, 139M
- Model size: 390M, 531M
- Output shape: [[12x12x64x1500],[12x12x1500x64]],[[1x1x51864],[12x12x64x224],[12x12x224x64]]

The source model can be found [here](https://github.com/openai/whisper/tree/main) 

## Performance Reference

Please search model by model name in [Model Farm](https://aiot.aidlux.com/en/models)

## Inference & Model Conversion

Please search model by model name in [Model Farm](https://aiot.aidlux.com/en/models)

## License

- Source Model: [MIT](https://github.com/openai/whisper/blob/main/LICENSE)

- Deployable Model: [APLUX-MODEL-FARM-LICENSE](https://aiot.aidlux.com/api/v1/files/license/model_farm_license_en.pdf)