Spaces:
Running
Running
A newer version of the Gradio SDK is available:
5.42.0
metadata
title: Audio Translator
emoji: π₯
colorFrom: pink
colorTo: purple
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: Audio Translator
π£οΈ Audio Translator
π Overview
Combine ASR, machine translation, and neural TTS into one seamless audio pipelineβ100 % CPU on free-tier HF Spaces.
Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back.
AI buzzwords:
β’ Automatic Speech Recognition (ASR) β’ Whisper Tiny β’ Neural Machine Translation β’ GoogleTranslator β’ Text-to-Speech β’ gTTS β’ Multi-modal AI β’ End-to-End Inference β’ Real-Time β’ Edge Deployment
β¨ Features
π Feature | π Description |
---|---|
ποΈ ASR: Whisper-Tiny | Lightning-fast, on-device speech transcription (all languages) |
π Translation | Bidirectional English β Spanish via Deep-Translator |
π£οΈ Neural TTS | High-quality audio playback via the free Google Translate TTS |
β‘ Zero-infra CPU | Runs on 2 vCPU / 16 GB RAMβno GPU or paid APIs needed |
π¨ Elegant UI | Intuitive Gradio Blocksβupload, buttons, transcripts, audio |
π§ Fully Modular | Swap models or add logging/analytics with minimal edits |
ποΈ Architecture & Workflow
- Audio Upload
User uploads any.wav
or.mp3
clip. - ASR
OpenAIβswhisper-tiny
decodes speech into text. - MT
deep-translator
βs GoogleTranslator converts text to chosen language. - TTS
gTTS
synthesizes the translated text into an.mp3
. - UI Rendering
Gradio presents the original transcript, the translation, and an audio player.
π οΈ Quick Start (Local Dev)
git clone https://github.com/<YOUR-USERNAME>/audio-translator.git
cd audio-translator
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python app.py
## Latest Update
- Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 π
- Optimized pipeline for lower latency. - August 14, 2025 π
- Added support for additional audio formats. π£οΈ - August 13, 2025 π
- Enhanced gTTS audio quality. π - August 12, 2025 π
- Improved translation accuracy for Spanish. π₯ - August 11, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - August 10, 2025 π
- Optimized pipeline for lower latency. - August 09, 2025 π
- Added support for additional audio formats. π₯ - August 08, 2025 π
- Enhanced gTTS audio quality. ποΈ - August 07, 2025 π
- Improved translation accuracy for Spanish. β‘ - August 06, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. π£οΈ - August 05, 2025 π
- Optimized pipeline for lower latency. - August 04, 2025 π
- Added support for additional audio formats. π - August 03, 2025 π
- Enhanced gTTS audio quality. - August 02, 2025 π
- Improved translation accuracy for Spanish. ποΈ - August 01, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - July 31, 2025 π
- Optimized pipeline for lower latency. π£οΈ - July 30, 2025 π
- Added support for additional audio formats. π₯ - July 29, 2025 π
- Enhanced gTTS audio quality. β‘ - July 28, 2025 π
- Improved translation accuracy for Spanish. - July 27, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - July 26, 2025 π
- Optimized pipeline for lower latency. - July 25, 2025 π
- Added support for additional audio formats. π - July 24, 2025 π
- Enhanced gTTS audio quality. - July 23, 2025 π
- Improved translation accuracy for Spanish. ποΈ - July 22, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - July 21, 2025 π
- Optimized pipeline for lower latency. π£οΈ - July 20, 2025 π
- Added support for additional audio formats. π₯ - July 19, 2025 π
- Enhanced gTTS audio quality. π - July 18, 2025 π
- Improved translation accuracy for Spanish. β‘ - July 17, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - July 16, 2025 π
- Optimized pipeline for lower latency. - July 15, 2025 π
- Added support for additional audio formats. π - July 11, 2025 π
- Enhanced gTTS audio quality. - July 10, 2025 π
- Improved translation accuracy for Spanish. - July 09, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. β‘ - July 08, 2025 π
- Optimized pipeline for lower latency. π£οΈ - July 07, 2025 π
- Added support for additional audio formats. - July 06, 2025 π
- Enhanced gTTS audio quality. π₯ - July 05, 2025 π
- Improved translation accuracy for Spanish. ποΈ - July 04, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - July 03, 2025 π
- Optimized pipeline for lower latency. - July 02, 2025 π
- Added support for additional audio formats. - July 01, 2025 π
- Enhanced gTTS audio quality. - June 30, 2025 π
- Improved translation accuracy for Spanish. β‘ - June 29, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 28, 2025 π
- Optimized pipeline for lower latency. - June 27, 2025 π
- Added support for additional audio formats. - June 26, 2025 π
- Enhanced gTTS audio quality. π - June 25, 2025 π
- Improved translation accuracy for Spanish. - June 24, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. π£οΈ - June 23, 2025 π
- Optimized pipeline for lower latency. π₯ - June 22, 2025 π
- Added support for additional audio formats. ποΈ - June 21, 2025 π
- Enhanced gTTS audio quality. - June 20, 2025 π
- Improved translation accuracy for Spanish. β‘ - June 19, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 18, 2025 π
- Optimized pipeline for lower latency. π - June 17, 2025 π
- Added support for additional audio formats. ποΈ - June 16, 2025 π
- Enhanced gTTS audio quality. - June 15, 2025 π
- Improved translation accuracy for Spanish. π£οΈ - June 14, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 13, 2025 π
- Optimized pipeline for lower latency. π₯ - June 12, 2025 π
- Added support for additional audio formats. β‘ - June 11, 2025 π
- Enhanced gTTS audio quality. - June 10, 2025 π
- Improved translation accuracy for Spanish. ποΈ - June 09, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 08, 2025 π
- Optimized pipeline for lower latency. π₯ - June 07, 2025 π
- Added support for additional audio formats. π - June 06, 2025 π
- Enhanced gTTS audio quality. π£οΈ - June 05, 2025 π
- Improved translation accuracy for Spanish. ποΈ - June 04, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. π - June 03, 2025 π
- Optimized pipeline for lower latency. π₯ - June 02, 2025 π
- Added support for additional audio formats. π£οΈ - June 01, 2025 π
- Enhanced gTTS audio quality. - May 31, 2025 π
- Improved translation accuracy for Spanish. β‘ - May 30, 2025 π
**Website**: https://ghostainews.com/
**Discord**: https://discord.gg/BfA23aYz