Spaces:

langtech-innovation
/

WhisperLiveKitDiarization

Paused

qfuxa commited on Mar 1

Commit

48c209f

2 Parent(s): 6ad63d0 519ccaa

Merge branch 'main' of https://github.com/QuentinFuxa/whisper_streaming_web

Files changed (2) hide show

README.md CHANGED Viewed

@@ -25,7 +25,7 @@ This project is based on [Whisper Streaming](https://github.com/ufal/whisper_str
 #### 🚀 **Coming Soon**
-- **Faster Word Validation** – Accelerate real-time transcription by validating high-confidence words immediately upon first appearance for whisper backends that returns words & segment probabilities
 - **Enhanced Diarization Performance** – Optimize speaker identification by implementing longer steps for Diart processing and leveraging language-specific segmentation patterns to improve speaker boundary detection
@@ -78,6 +78,8 @@ This project is based on [Whisper Streaming](https://github.com/ufal/whisper_str
     diart
     ```
 3. **Run the FastAPI Server**:

 #### 🚀 **Coming Soon**
+- **Faster Word Validation** – Accelerate real-time transcription by validating high-confidence words immediately upon first appearance for whisper backends that return word & segment probabilities
 - **Enhanced Diarization Performance** – Optimize speaker identification by implementing longer steps for Diart processing and leveraging language-specific segmentation patterns to improve speaker boundary detection
     diart
     ```
+    Diart uses by default [pyannote.audio](https://github.com/pyannote/pyannote-audio) models from the _huggingface hub_. To use them, please follow the steps described [here](https://github.com/juanmc2005/diart?tab=readme-ov-file#get-access-to--pyannote-models).
 3. **Run the FastAPI Server**:

src/__init__.py DELETED Viewed

File without changes