Spaces:

ash-171
/

accent-detection

Sleeping

ash-171 commited on May 30

Commit

43fea58

verified ·

1 Parent(s): 4cbb775

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -14,12 +14,12 @@ license: mit
 # Accent Analyzer
-This is a Streamlit-based web application that analyzes the English accent in spoken videos. Users can provide a public video URL (MP4), receive a transcription of the speech, and ask follow-up questions based on the transcript using Gemma3:1b.
 ## What It Does
 - Accepts a public **MP4 video URL**
-- Extracts audio and transcribes it using **OpenAI Whisper Medium**
 - Detects accent using a **Jzuluaga/accent-id-commonaccent_xlsr-en-english** model
 - Lets users ask **follow-up questions** about the transcript using **Gemma3**
 - Deploys easily on **Hugging Face Spaces** with CPU
@@ -29,7 +29,7 @@ This is a Streamlit-based web application that analyzes the English accent in sp
 ## Tech Stack
 - **Streamlit** — UI
-- **OpenAI Whisper (medium)**: For speech-to-text transcription.
 - **Jzuluaga/accent-id-commonaccent_xlsr-en-english**: For English accent classification.
 - **Gemma3:1b via Ollama**: For generating answers to follow-up questions using context from the transcript.
 - **Docker** — containerized for deployment

 # Accent Analyzer
+This is a Streamlit-based web application that analyzes the English accent in spoken videos. Users can provide a public video URL (MP4), receive a transcription of the speech using Whisper Base, and ask follow-up questions based on the transcript using Gemma3:1b.
 ## What It Does
 - Accepts a public **MP4 video URL**
+- Extracts audio and transcribes it using **OpenAI Whisper Base**
 - Detects accent using a **Jzuluaga/accent-id-commonaccent_xlsr-en-english** model
 - Lets users ask **follow-up questions** about the transcript using **Gemma3**
 - Deploys easily on **Hugging Face Spaces** with CPU
 ## Tech Stack
 - **Streamlit** — UI
+- **OpenAI Whisper (base)**: For speech-to-text transcription.
 - **Jzuluaga/accent-id-commonaccent_xlsr-en-english**: For English accent classification.
 - **Gemma3:1b via Ollama**: For generating answers to follow-up questions using context from the transcript.
 - **Docker** — containerized for deployment