GavinHuang commited on
Commit
4efbce4
ยท
1 Parent(s): bb03f90

update README for real-time speech-to-text application and remove spaces.GPU decorator from load_model function

Browse files
Files changed (2) hide show
  1. README.md +29 -3
  2. app.py +0 -2
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
- title: Asr Demo
3
- emoji: ๐Ÿš€
4
  colorFrom: indigo
5
  colorTo: gray
6
  sdk: gradio
@@ -9,4 +9,30 @@ app_file: app.py
9
  pinned: false
10
  ---
11
 
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: Real-time Speech-to-Text
3
+ emoji: ๐ŸŽ™๏ธ
4
  colorFrom: indigo
5
  colorTo: gray
6
  sdk: gradio
 
9
  pinned: false
10
  ---
11
 
12
+ # Real-time Speech-to-Text with NeMo
13
+
14
+ This is a real-time speech-to-text transcription application powered by NVIDIA NeMo and the parakeet-tdt-0.6b-v2 model.
15
+
16
+ ## Features
17
+
18
+ - ๐ŸŽ™๏ธ Web-based microphone input
19
+ - โšก Real-time transcription displayed in the browser
20
+ - ๐Ÿง  Fast inference with NeMo pre-trained model
21
+ - ๐Ÿ› ๏ธ Easy to use, no installations required
22
+
23
+ ## Tech Stack
24
+
25
+ - Python
26
+ - Gradio
27
+ - NVIDIA NeMo Toolkit for ASR
28
+
29
+ ## How to Use
30
+
31
+ 1. Click the microphone button to start recording
32
+ 2. Speak clearly into your microphone
33
+ 3. The transcription will appear in real-time
34
+ 4. Click 'Clear Transcript' to start a new transcription
35
+
36
+ ## Note
37
+
38
+ This application requires access to your microphone to function. The audio is processed in real-time and is not stored.
app.py CHANGED
@@ -4,7 +4,6 @@ import torch
4
  import nemo.collections.asr as nemo_asr
5
  from omegaconf import OmegaConf
6
  import time
7
- import spaces
8
 
9
  # Check if CUDA is available
10
  print(f"CUDA available: {torch.cuda.is_available()}")
@@ -12,7 +11,6 @@ if torch.cuda.is_available():
12
  print(f"CUDA device: {torch.cuda.get_device_name(0)}")
13
 
14
  # Initialize the ASR model - removed spaces.GPU decorator due to pickling issues
15
- @spaces.GPU
16
  def load_model():
17
  print("Loading ASR model...")
18
  # Load the NVIDIA NeMo ASR model
 
4
  import nemo.collections.asr as nemo_asr
5
  from omegaconf import OmegaConf
6
  import time
 
7
 
8
  # Check if CUDA is available
9
  print(f"CUDA available: {torch.cuda.is_available()}")
 
11
  print(f"CUDA device: {torch.cuda.get_device_name(0)}")
12
 
13
  # Initialize the ASR model - removed spaces.GPU decorator due to pickling issues
 
14
  def load_model():
15
  print("Loading ASR model...")
16
  # Load the NVIDIA NeMo ASR model