Spaces:

Rafii
/

SpeechSegmenter

Sleeping

Rafii commited on 11 days ago

Commit

cb012cd

verified ·

1 Parent(s): 40749dc

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-title: Audio Processing
 emoji: 🏃
 colorFrom: gray
 colorTo: blue
@@ -9,4 +9,13 @@ app_file: app.py
 pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: Speech Segmenter (STT)
 emoji: 🏃
 colorFrom: gray
 colorTo: blue
 pinned: false
 ---
+# What this app can do
+This app is an advanced **Speech-to-Text (STT)** pipeline enhanced with alignment and speaker diarization:
+- **STT (Speech-to-Text):** Converts spoken audio into written text (transcription).
+- **Alignment:** Aligns words with their timestamps in the audio (word-level timing).
+- **Speaker Diarization:** Detects and labels who spoke when — the “who spoke what” part.
+- **Post-processing:** Combines all that info to produce a richer, structured transcript.
+Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference