Spaces:
Running
Running
metadata
title: Audio Translator
emoji: π₯
colorFrom: pink
colorTo: purple
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: Audio Translator
π£οΈ Audio Translator
π Overview
Combine ASR, machine translation, and neural TTS into one seamless audio pipelineβ100 % CPU on free-tier HF Spaces.
Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back.
AI buzzwords:
β’ Automatic Speech Recognition (ASR) β’ Whisper Tiny β’ Neural Machine Translation β’ GoogleTranslator β’ Text-to-Speech β’ gTTS β’ Multi-modal AI β’ End-to-End Inference β’ Real-Time β’ Edge Deployment
β¨ Features
π Feature | π Description |
---|---|
ποΈ ASR: Whisper-Tiny | Lightning-fast, on-device speech transcription (all languages) |
π Translation | Bidirectional English β Spanish via Deep-Translator |
π£οΈ Neural TTS | High-quality audio playback via the free Google Translate TTS |
β‘ Zero-infra CPU | Runs on 2 vCPU / 16 GB RAMβno GPU or paid APIs needed |
π¨ Elegant UI | Intuitive Gradio Blocksβupload, buttons, transcripts, audio |
π§ Fully Modular | Swap models or add logging/analytics with minimal edits |
ποΈ Architecture & Workflow
- Audio Upload
User uploads any.wav
or.mp3
clip. - ASR
OpenAIβswhisper-tiny
decodes speech into text. - MT
deep-translator
βs GoogleTranslator converts text to chosen language. - TTS
gTTS
synthesizes the translated text into an.mp3
. - UI Rendering
Gradio presents the original transcript, the translation, and an audio player.
π οΈ Quick Start (Local Dev)
git clone https://github.com/<YOUR-USERNAME>/audio-translator.git
cd audio-translator
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python app.py
## Latest Update
- Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 π
- Added support for additional audio formats. ποΈ - June 21, 2025 π
- Enhanced gTTS audio quality. - June 20, 2025 π
- Improved translation accuracy for Spanish. β‘ - June 19, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 18, 2025 π
- Optimized pipeline for lower latency. π - June 17, 2025 π
- Added support for additional audio formats. ποΈ - June 16, 2025 π
- Enhanced gTTS audio quality. - June 15, 2025 π
- Improved translation accuracy for Spanish. π£οΈ - June 14, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 13, 2025 π
- Optimized pipeline for lower latency. π₯ - June 12, 2025 π
- Added support for additional audio formats. β‘ - June 11, 2025 π
- Enhanced gTTS audio quality. - June 10, 2025 π
- Improved translation accuracy for Spanish. ποΈ - June 09, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 08, 2025 π
- Optimized pipeline for lower latency. π₯ - June 07, 2025 π
- Added support for additional audio formats. π - June 06, 2025 π
- Enhanced gTTS audio quality. π£οΈ - June 05, 2025 π
- Improved translation accuracy for Spanish. ποΈ - June 04, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. π - June 03, 2025 π
- Optimized pipeline for lower latency. π₯ - June 02, 2025 π
- Added support for additional audio formats. π£οΈ - June 01, 2025 π
- Enhanced gTTS audio quality. - May 31, 2025 π
- Improved translation accuracy for Spanish. β‘ - May 30, 2025 π
**Website**: https://ghostainews.com/
**Discord**: https://discord.gg/BfA23aYz