add coming soon
Browse files
README.md
CHANGED
@@ -3,7 +3,7 @@
|
|
3 |
This project is based on [Whisper Streaming](https://github.com/ufal/whisper_streaming) and lets you transcribe audio directly from your browser. Simply launch the local server and grant microphone access. Everything runs locally on your machine β¨
|
4 |
|
5 |
<p align="center">
|
6 |
-
<img src="
|
7 |
</p>
|
8 |
|
9 |
### Differences from [Whisper Streaming](https://github.com/ufal/whisper_streaming)
|
@@ -23,6 +23,11 @@ This project is based on [Whisper Streaming](https://github.com/ufal/whisper_str
|
|
23 |
#### ποΈ **Advanced Features**
|
24 |
- **Real-Time Diarization** β Identify different speakers in real time using [Diart](https://github.com/juanmc2005/diart).
|
25 |
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
## Installation
|
28 |
|
|
|
3 |
This project is based on [Whisper Streaming](https://github.com/ufal/whisper_streaming) and lets you transcribe audio directly from your browser. Simply launch the local server and grant microphone access. Everything runs locally on your machine β¨
|
4 |
|
5 |
<p align="center">
|
6 |
+
<img src="web/demo.png" alt="Demo Screenshot" width="600">
|
7 |
</p>
|
8 |
|
9 |
### Differences from [Whisper Streaming](https://github.com/ufal/whisper_streaming)
|
|
|
23 |
#### ποΈ **Advanced Features**
|
24 |
- **Real-Time Diarization** β Identify different speakers in real time using [Diart](https://github.com/juanmc2005/diart).
|
25 |
|
26 |
+
#### π **Coming Soon**
|
27 |
+
|
28 |
+
- **Faster Word Validation** β Accelerate real-time transcription by validating high-confidence words immediately upon first appearance for whisper backends that returns words & segment probabilities
|
29 |
+
- **Enhanced Diarization Performance** β Optimize speaker identification by implementing longer steps for Diart processing and leveraging language-specific segmentation patterns to improve speaker boundary detection
|
30 |
+
|
31 |
|
32 |
## Installation
|
33 |
|