Abid Ali Awan commited on
Commit
3fa36e3
·
1 Parent(s): ca9beed

Update app.py to change model loading to use float16 for improved performance, and remove dynamic quantization. Update README.md to change the emoji and adjust color settings for better visual consistency.

Browse files
Files changed (2) hide show
  1. README.md +2 -5
  2. app.py +1 -3
README.md CHANGED
@@ -1,8 +1,8 @@
1
  ---
2
  title: Transcribed Urdu
3
- emoji: ☪️
4
  colorFrom: indigo
5
- colorTo: blue
6
  sdk: gradio
7
  sdk_version: 5.35.0
8
  app_file: app.py
@@ -14,6 +14,3 @@ short_description: The most accurate Urdu speech recognition app.
14
  # whisper-large-v3-turbo-urdu
15
 
16
  This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) on the common_voice_17_0 dataset.
17
- It achieves the following results on the evaluation set:
18
- - Loss: 0.4630
19
- - Wer: 0.3826
 
1
  ---
2
  title: Transcribed Urdu
3
+ emoji: 🎙️
4
  colorFrom: indigo
5
+ colorTo: indigo
6
  sdk: gradio
7
  sdk_version: 5.35.0
8
  app_file: app.py
 
14
  # whisper-large-v3-turbo-urdu
15
 
16
  This model is a fine-tuned version of [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) on the common_voice_17_0 dataset.
 
 
 
app.py CHANGED
@@ -27,11 +27,9 @@ model_id = "kingabzpro/whisper-large-v3-turbo-urdu"
27
  # Load in fp32 and quantize to int8
28
  model = AutoModelForSpeechSeq2Seq.from_pretrained(
29
  model_id,
30
- torch_dtype=torch.float32,
31
  use_safetensors=True,
32
  )
33
- model.eval()
34
- model = torch.quantization.quantize_dynamic(model, {torch.nn.Linear}, dtype=torch.qint8)
35
 
36
  processor = AutoProcessor.from_pretrained(model_id)
37
 
 
27
  # Load in fp32 and quantize to int8
28
  model = AutoModelForSpeechSeq2Seq.from_pretrained(
29
  model_id,
30
+ torch_dtype=torch.float16,
31
  use_safetensors=True,
32
  )
 
 
33
 
34
  processor = AutoProcessor.from_pretrained(model_id)
35