metadata

title: Accent Classifier + Transcriber
emoji: 🎙️
colorFrom: indigo
colorTo: purple
sdk: gradio
sdk_version: 4.20.0
app_file: app.py
pinned: false

Accent Classifier + Speech Transcriber

This Gradio app allows you to:

How to Use

Option 1: Upload an audio file

Option 2: Upload a video file

Option 3: Paste a direct .mp4 video URL

Loom, YouTube, Dropbox, or other webpage links (they don't serve real video files)
Download the video manually and upload it if needed

Transcription:

Accent Classification:

To set this up and run locally, follow these steps:

git clone https://huggingface.co/spaces/usamaijaz-ai/accent-classifier
cd accent-classifier

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

pip install -r requirements.txt

If there’s no requirements.txt, use:

pip install gradio==4.20.0 transformers torch moviepy==1.0.3 requests safetensors soundfile scipy

python app.py

Access in your browser
Visit http://localhost:7860 to use the app locally.

Audio is extracted (if input is a video)
Audio is converted to .wav and resampled to 16kHz
Speech is transcribed using Whisper
Accent is classified using a Wav2Vec2 model
Output includes:
- Top accent prediction
- Confidence score
- Top-5 accent list
- Full transcription