qfuxa commited on
Commit
ef38b4d
Β·
1 Parent(s): 3679fc2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -1,13 +1,13 @@
1
- # Whisper Streaming with FastAPI & WebSocket Integration
2
 
3
- A feature-packed fork of [Whisper Streaming](https://github.com/ufal/whisper_streaming) with **real-time speech-to-text (STT) enhancements**, multi-user support, and a JavaScript client πŸŽ™οΈβœ¨
4
 
5
  ## What's New?
6
-
7
  βœ… **FastAPI Server with WebSocket Endpoint** – Enables real-time STT in browsers with async FFmpeg processing.
8
  βœ… **Buffering Preview** – Displays unvalidated buffer content for better streaming feedback.
9
  βœ… **Multiple Users Support** – The backend handles multiple users simultaneously without conflicts.
10
- βœ… **HTML - JavaScript Client Implementation** – A plug-and-play MediaRecorder setup for seamless client integration.
11
  βœ… **MLX Whisper Backend** – Optimized Apple Silicon support for faster local processing.
12
  βœ… **Enhanced sentence segmentation** – Improves buffer trimming and sentence boundaries in certain languages
13
  βœ… **Diarization (Beta)** – Real-time speaker labeling using [Diart](https://github.com/juanmc2005/diart).
 
1
+ # Whisper Streaming Web: Real-time Speech-to-Text with Web UI & FastAPI WebSocket
2
 
3
+ This fork of [Whisper Streaming](https://github.com/ufal/whisper_streaming) adds a ready-to-use HTML interface, making it super easy to start transcribing audio directly from your browser. Just launch the local server, allow microphone access, and start streaming. Everything runs locally on your machine πŸŽ™οΈβœ¨
4
 
5
  ## What's New?
6
+ βœ… **Built-in Web UI** – Just open your browser and start transcribing, no need to build a frontend.
7
  βœ… **FastAPI Server with WebSocket Endpoint** – Enables real-time STT in browsers with async FFmpeg processing.
8
  βœ… **Buffering Preview** – Displays unvalidated buffer content for better streaming feedback.
9
  βœ… **Multiple Users Support** – The backend handles multiple users simultaneously without conflicts.
10
+ βœ… **HTML - JavaScript Client Implementation** – A plug-and-play MediaRecorder setup for seamless client integration
11
  βœ… **MLX Whisper Backend** – Optimized Apple Silicon support for faster local processing.
12
  βœ… **Enhanced sentence segmentation** – Improves buffer trimming and sentence boundaries in certain languages
13
  βœ… **Diarization (Beta)** – Real-time speaker labeling using [Diart](https://github.com/juanmc2005/diart).