ash-171 commited on
Commit
43fea58
·
verified ·
1 Parent(s): 4cbb775

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -14,12 +14,12 @@ license: mit
14
 
15
  # Accent Analyzer
16
 
17
- This is a Streamlit-based web application that analyzes the English accent in spoken videos. Users can provide a public video URL (MP4), receive a transcription of the speech, and ask follow-up questions based on the transcript using Gemma3:1b.
18
 
19
  ## What It Does
20
 
21
  - Accepts a public **MP4 video URL**
22
- - Extracts audio and transcribes it using **OpenAI Whisper Medium**
23
  - Detects accent using a **Jzuluaga/accent-id-commonaccent_xlsr-en-english** model
24
  - Lets users ask **follow-up questions** about the transcript using **Gemma3**
25
  - Deploys easily on **Hugging Face Spaces** with CPU
@@ -29,7 +29,7 @@ This is a Streamlit-based web application that analyzes the English accent in sp
29
  ## Tech Stack
30
 
31
  - **Streamlit** — UI
32
- - **OpenAI Whisper (medium)**: For speech-to-text transcription.
33
  - **Jzuluaga/accent-id-commonaccent_xlsr-en-english**: For English accent classification.
34
  - **Gemma3:1b via Ollama**: For generating answers to follow-up questions using context from the transcript.
35
  - **Docker** — containerized for deployment
 
14
 
15
  # Accent Analyzer
16
 
17
+ This is a Streamlit-based web application that analyzes the English accent in spoken videos. Users can provide a public video URL (MP4), receive a transcription of the speech using Whisper Base, and ask follow-up questions based on the transcript using Gemma3:1b.
18
 
19
  ## What It Does
20
 
21
  - Accepts a public **MP4 video URL**
22
+ - Extracts audio and transcribes it using **OpenAI Whisper Base**
23
  - Detects accent using a **Jzuluaga/accent-id-commonaccent_xlsr-en-english** model
24
  - Lets users ask **follow-up questions** about the transcript using **Gemma3**
25
  - Deploys easily on **Hugging Face Spaces** with CPU
 
29
  ## Tech Stack
30
 
31
  - **Streamlit** — UI
32
+ - **OpenAI Whisper (base)**: For speech-to-text transcription.
33
  - **Jzuluaga/accent-id-commonaccent_xlsr-en-english**: For English accent classification.
34
  - **Gemma3:1b via Ollama**: For generating answers to follow-up questions using context from the transcript.
35
  - **Docker** — containerized for deployment