Update README.md
Browse files
README.md
CHANGED
@@ -6,22 +6,22 @@ This project is based on [Whisper Streaming](https://github.com/ufal/whisper_str
|
|
6 |
<img src="src/web/demo.png" alt="Demo Screenshot" width="600">
|
7 |
</p>
|
8 |
|
9 |
-
###
|
10 |
|
11 |
#### π **Web & API**
|
12 |
-
- **Built-in Web UI** β No frontend setup
|
13 |
-
- **FastAPI WebSocket Server** β Real-time
|
14 |
-
- **JavaScript Client** β
|
15 |
|
16 |
#### βοΈ **Core Improvements**
|
17 |
-
- **Buffering Preview** β Displays unvalidated transcription segments for
|
18 |
-
- **Multi-User Support** β
|
19 |
- **MLX Whisper Backend** β Optimized for Apple Silicon for faster local processing.
|
20 |
-
- **Enhanced Sentence Segmentation** β
|
21 |
- **Extended Logging** β More detailed logs to improve debugging and monitoring.
|
22 |
|
23 |
#### ποΈ **Advanced Features**
|
24 |
-
- **Real-Time Diarization** β
|
25 |
|
26 |
|
27 |
## Installation
|
|
|
6 |
<img src="src/web/demo.png" alt="Demo Screenshot" width="600">
|
7 |
</p>
|
8 |
|
9 |
+
### Differences from [Whisper Streaming](https://github.com/ufal/whisper_streaming)
|
10 |
|
11 |
#### π **Web & API**
|
12 |
+
- **Built-in Web UI** β No frontend setup required, just open your browser and start transcribing.
|
13 |
+
- **FastAPI WebSocket Server** β Real-time speech-to-text processing with async FFmpeg streaming.
|
14 |
+
- **JavaScript Client** β Ready-to-use MediaRecorder implementation for seamless client-side integration.
|
15 |
|
16 |
#### βοΈ **Core Improvements**
|
17 |
+
- **Buffering Preview** β Displays unvalidated transcription segments for immediate feedback.
|
18 |
+
- **Multi-User Support** β Handles multiple users simultaneously without conflicts.
|
19 |
- **MLX Whisper Backend** β Optimized for Apple Silicon for faster local processing.
|
20 |
+
- **Enhanced Sentence Segmentation** β Improved buffer trimming for better accuracy across languages.
|
21 |
- **Extended Logging** β More detailed logs to improve debugging and monitoring.
|
22 |
|
23 |
#### ποΈ **Advanced Features**
|
24 |
+
- **Real-Time Diarization** β Identify different speakers in real time using [Diart](https://github.com/juanmc2005/diart).
|
25 |
|
26 |
|
27 |
## Installation
|