Spaces:
Sleeping
Sleeping
metadata
title: Accent Classifier
emoji: ποΈ
colorFrom: teal
colorTo: cyan
sdk: gradio
sdk_version: 3.38.1
app_file: app.py
pinned: false
Accent Classifier ποΈ
This app downloads a public YouTube or Vimeo video, extracts its audio, and classifies the speaker's accent (or rather, speaker ID as a proxy) using a Hugging Face model.
How it works
- You provide a video URL.
- The app downloads the audio using
yt-dlp
. - It extracts the audio in a format suitable for the model (
wav
, 16kHz, mono). - It runs the
superb/wav2vec2-base-superb-sid
model to classify the speaker. - Displays the predicted speaker ID and confidence.
Requirements
- Python 3.8+
yt-dlp
ffmpeg
installed on your system and accessible from the command line.gradio
for the UI.transformers
from Hugging Face.
Usage
Run the app:
python app.py