Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -14,12 +14,12 @@ license: mit
|
|
14 |
|
15 |
# Accent Analyzer
|
16 |
|
17 |
-
This is a Streamlit-based web application that analyzes the English accent in spoken videos. Users can provide a public video URL (MP4), receive a transcription of the speech, and ask follow-up questions based on the transcript using Gemma3:1b.
|
18 |
|
19 |
## What It Does
|
20 |
|
21 |
- Accepts a public **MP4 video URL**
|
22 |
-
- Extracts audio and transcribes it using **OpenAI Whisper
|
23 |
- Detects accent using a **Jzuluaga/accent-id-commonaccent_xlsr-en-english** model
|
24 |
- Lets users ask **follow-up questions** about the transcript using **Gemma3**
|
25 |
- Deploys easily on **Hugging Face Spaces** with CPU
|
@@ -29,7 +29,7 @@ This is a Streamlit-based web application that analyzes the English accent in sp
|
|
29 |
## Tech Stack
|
30 |
|
31 |
- **Streamlit** — UI
|
32 |
-
- **OpenAI Whisper (
|
33 |
- **Jzuluaga/accent-id-commonaccent_xlsr-en-english**: For English accent classification.
|
34 |
- **Gemma3:1b via Ollama**: For generating answers to follow-up questions using context from the transcript.
|
35 |
- **Docker** — containerized for deployment
|
|
|
14 |
|
15 |
# Accent Analyzer
|
16 |
|
17 |
+
This is a Streamlit-based web application that analyzes the English accent in spoken videos. Users can provide a public video URL (MP4), receive a transcription of the speech using Whisper Base, and ask follow-up questions based on the transcript using Gemma3:1b.
|
18 |
|
19 |
## What It Does
|
20 |
|
21 |
- Accepts a public **MP4 video URL**
|
22 |
+
- Extracts audio and transcribes it using **OpenAI Whisper Base**
|
23 |
- Detects accent using a **Jzuluaga/accent-id-commonaccent_xlsr-en-english** model
|
24 |
- Lets users ask **follow-up questions** about the transcript using **Gemma3**
|
25 |
- Deploys easily on **Hugging Face Spaces** with CPU
|
|
|
29 |
## Tech Stack
|
30 |
|
31 |
- **Streamlit** — UI
|
32 |
+
- **OpenAI Whisper (base)**: For speech-to-text transcription.
|
33 |
- **Jzuluaga/accent-id-commonaccent_xlsr-en-english**: For English accent classification.
|
34 |
- **Gemma3:1b via Ollama**: For generating answers to follow-up questions using context from the transcript.
|
35 |
- **Docker** — containerized for deployment
|