qfuxa commited on
Commit
b3a32cd
Β·
1 Parent(s): 818897e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -6,22 +6,22 @@ This project is based on [Whisper Streaming](https://github.com/ufal/whisper_str
6
  <img src="src/web/demo.png" alt="Demo Screenshot" width="600">
7
  </p>
8
 
9
- ### What's New?
10
 
11
  #### 🌐 **Web & API**
12
- - **Built-in Web UI** – No frontend setup needed, just open your browser and start transcribing.
13
- - **FastAPI WebSocket Server** – Real-time STT processing with async FFmpeg streaming.
14
- - **JavaScript Client** – A ready-to-use MediaRecorder implementation that can be copied on your client side.
15
 
16
  #### βš™οΈ **Core Improvements**
17
- - **Buffering Preview** – Displays unvalidated transcription segments for better feedback.
18
- - **Multi-User Support** – Handle multiple users simultaneously without conflicts.
19
  - **MLX Whisper Backend** – Optimized for Apple Silicon for faster local processing.
20
- - **Enhanced Sentence Segmentation** – Better buffer trimming for better accuracy across languages.
21
  - **Extended Logging** – More detailed logs to improve debugging and monitoring.
22
 
23
  #### πŸŽ™οΈ **Advanced Features**
24
- - **Real-Time Diarization** – Recognize different speakers in real time using [Diart](https://github.com/juanmc2005/diart).
25
 
26
 
27
  ## Installation
 
6
  <img src="src/web/demo.png" alt="Demo Screenshot" width="600">
7
  </p>
8
 
9
+ ### Differences from [Whisper Streaming](https://github.com/ufal/whisper_streaming)
10
 
11
  #### 🌐 **Web & API**
12
+ - **Built-in Web UI** – No frontend setup required, just open your browser and start transcribing.
13
+ - **FastAPI WebSocket Server** – Real-time speech-to-text processing with async FFmpeg streaming.
14
+ - **JavaScript Client** – Ready-to-use MediaRecorder implementation for seamless client-side integration.
15
 
16
  #### βš™οΈ **Core Improvements**
17
+ - **Buffering Preview** – Displays unvalidated transcription segments for immediate feedback.
18
+ - **Multi-User Support** – Handles multiple users simultaneously without conflicts.
19
  - **MLX Whisper Backend** – Optimized for Apple Silicon for faster local processing.
20
+ - **Enhanced Sentence Segmentation** – Improved buffer trimming for better accuracy across languages.
21
  - **Extended Logging** – More detailed logs to improve debugging and monitoring.
22
 
23
  #### πŸŽ™οΈ **Advanced Features**
24
+ - **Real-Time Diarization** – Identify different speakers in real time using [Diart](https://github.com/juanmc2005/diart).
25
 
26
 
27
  ## Installation