Rafii commited on
Commit
cb012cd
·
verified ·
1 Parent(s): 40749dc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -2
README.md CHANGED
@@ -1,5 +1,5 @@
1
  ---
2
- title: Audio Processing
3
  emoji: 🏃
4
  colorFrom: gray
5
  colorTo: blue
@@ -9,4 +9,13 @@ app_file: app.py
9
  pinned: false
10
  ---
11
 
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: Speech Segmenter (STT)
3
  emoji: 🏃
4
  colorFrom: gray
5
  colorTo: blue
 
9
  pinned: false
10
  ---
11
 
12
+ # What this app can do
13
+
14
+ This app is an advanced **Speech-to-Text (STT)** pipeline enhanced with alignment and speaker diarization:
15
+
16
+ - **STT (Speech-to-Text):** Converts spoken audio into written text (transcription).
17
+ - **Alignment:** Aligns words with their timestamps in the audio (word-level timing).
18
+ - **Speaker Diarization:** Detects and labels who spoke when — the “who spoke what” part.
19
+ - **Post-processing:** Combines all that info to produce a richer, structured transcript.
20
+
21
+ Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference