Spaces:
Running
Running
title: Audio Translator | |
emoji: π₯ | |
colorFrom: pink | |
colorTo: purple | |
sdk: gradio | |
sdk_version: 5.31.0 | |
app_file: app.py | |
pinned: false | |
license: apache-2.0 | |
short_description: Audio Translator | |
# π£οΈ Audio Translator | |
[](https://huggingface.co/spaces/<YOUR-USERNAME>/audio-translator) | |
[] | |
[] | |
[] | |
[] | |
[](LICENSE) | |
--- | |
## π Overview | |
Combine **ASR**, **machine translation**, and **neural TTS** into one **seamless audio pipeline**β100 % **CPU** on free-tier HF Spaces. | |
Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back. | |
> **AI buzzwords:** | |
> β’ Automatic Speech Recognition (ASR) β’ Whisper Tiny β’ Neural Machine Translation β’ GoogleTranslator β’ Text-to-Speech β’ gTTS β’ Multi-modal AI β’ End-to-End Inference β’ Real-Time β’ Edge Deployment | |
--- | |
## β¨ Features | |
| π Feature | π Description | | |
|---------------------------|---------------------------------------------------------------| | |
| **ποΈ ASR: Whisper-Tiny** | Lightning-fast, on-device speech transcription (all languages) | | |
| **π Translation** | Bidirectional English β Spanish via Deep-Translator | | |
| **π£οΈ Neural TTS** | High-quality audio playback via the free Google Translate TTS | | |
| **β‘ Zero-infra CPU** | Runs on 2 vCPU / 16 GB RAMβno GPU or paid APIs needed | | |
| **π¨ Elegant UI** | Intuitive Gradio Blocksβupload, buttons, transcripts, audio | | |
| **π§ Fully Modular** | Swap models or add logging/analytics with minimal edits | | |
--- | |
## ποΈ Architecture & Workflow | |
1. **Audio Upload** | |
User uploads any `.wav` or `.mp3` clip. | |
2. **ASR** | |
OpenAIβs `whisper-tiny` decodes speech into text. | |
3. **MT** | |
`deep-translator`βs GoogleTranslator converts text to chosen language. | |
4. **TTS** | |
`gTTS` synthesizes the translated text into an `.mp3`. | |
5. **UI Rendering** | |
Gradio presents the original transcript, the translation, and an audio player. | |
--- | |
## π οΈ Quick Start (Local Dev) | |
```bash | |
git clone https://github.com/<YOUR-USERNAME>/audio-translator.git | |
cd audio-translator | |
python3 -m venv venv && source venv/bin/activate | |
pip install -r requirements.txt | |
python app.py | |
## Latest Update | |
- Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 π | |
- Optimized pipeline for lower latency. - August 14, 2025 π | |
- Added support for additional audio formats. π£οΈ - August 13, 2025 π | |
- Enhanced gTTS audio quality. π - August 12, 2025 π | |
- Improved translation accuracy for Spanish. π₯ - August 11, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. - August 10, 2025 π | |
- Optimized pipeline for lower latency. - August 09, 2025 π | |
- Added support for additional audio formats. π₯ - August 08, 2025 π | |
- Enhanced gTTS audio quality. ποΈ - August 07, 2025 π | |
- Improved translation accuracy for Spanish. β‘ - August 06, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. π£οΈ - August 05, 2025 π | |
- Optimized pipeline for lower latency. - August 04, 2025 π | |
- Added support for additional audio formats. π - August 03, 2025 π | |
- Enhanced gTTS audio quality. - August 02, 2025 π | |
- Improved translation accuracy for Spanish. ποΈ - August 01, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. - July 31, 2025 π | |
- Optimized pipeline for lower latency. π£οΈ - July 30, 2025 π | |
- Added support for additional audio formats. π₯ - July 29, 2025 π | |
- Enhanced gTTS audio quality. β‘ - July 28, 2025 π | |
- Improved translation accuracy for Spanish. - July 27, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. - July 26, 2025 π | |
- Optimized pipeline for lower latency. - July 25, 2025 π | |
- Added support for additional audio formats. π - July 24, 2025 π | |
- Enhanced gTTS audio quality. - July 23, 2025 π | |
- Improved translation accuracy for Spanish. ποΈ - July 22, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. - July 21, 2025 π | |
- Optimized pipeline for lower latency. π£οΈ - July 20, 2025 π | |
- Added support for additional audio formats. π₯ - July 19, 2025 π | |
- Enhanced gTTS audio quality. π - July 18, 2025 π | |
- Improved translation accuracy for Spanish. β‘ - July 17, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. - July 16, 2025 π | |
- Optimized pipeline for lower latency. - July 15, 2025 π | |
- Added support for additional audio formats. π - July 11, 2025 π | |
- Enhanced gTTS audio quality. - July 10, 2025 π | |
- Improved translation accuracy for Spanish. - July 09, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. β‘ - July 08, 2025 π | |
- Optimized pipeline for lower latency. π£οΈ - July 07, 2025 π | |
- Added support for additional audio formats. - July 06, 2025 π | |
- Enhanced gTTS audio quality. π₯ - July 05, 2025 π | |
- Improved translation accuracy for Spanish. ποΈ - July 04, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. - July 03, 2025 π | |
- Optimized pipeline for lower latency. - July 02, 2025 π | |
- Added support for additional audio formats. - July 01, 2025 π | |
- Enhanced gTTS audio quality. - June 30, 2025 π | |
- Improved translation accuracy for Spanish. β‘ - June 29, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. - June 28, 2025 π | |
- Optimized pipeline for lower latency. - June 27, 2025 π | |
- Added support for additional audio formats. - June 26, 2025 π | |
- Enhanced gTTS audio quality. π - June 25, 2025 π | |
- Improved translation accuracy for Spanish. - June 24, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. π£οΈ - June 23, 2025 π | |
- Optimized pipeline for lower latency. π₯ - June 22, 2025 π | |
- Added support for additional audio formats. ποΈ - June 21, 2025 π | |
- Enhanced gTTS audio quality. - June 20, 2025 π | |
- Improved translation accuracy for Spanish. β‘ - June 19, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. - June 18, 2025 π | |
- Optimized pipeline for lower latency. π - June 17, 2025 π | |
- Added support for additional audio formats. ποΈ - June 16, 2025 π | |
- Enhanced gTTS audio quality. - June 15, 2025 π | |
- Improved translation accuracy for Spanish. π£οΈ - June 14, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. - June 13, 2025 π | |
- Optimized pipeline for lower latency. π₯ - June 12, 2025 π | |
- Added support for additional audio formats. β‘ - June 11, 2025 π | |
- Enhanced gTTS audio quality. - June 10, 2025 π | |
- Improved translation accuracy for Spanish. ποΈ - June 09, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. - June 08, 2025 π | |
- Optimized pipeline for lower latency. π₯ - June 07, 2025 π | |
- Added support for additional audio formats. π - June 06, 2025 π | |
- Enhanced gTTS audio quality. π£οΈ - June 05, 2025 π | |
- Improved translation accuracy for Spanish. ποΈ - June 04, 2025 π | |
- Upgraded Whisper-Tiny model for faster ASR. π - June 03, 2025 π | |
- Optimized pipeline for lower latency. π₯ - June 02, 2025 π | |
- Added support for additional audio formats. π£οΈ - June 01, 2025 π | |
- Enhanced gTTS audio quality. - May 31, 2025 π | |
- Improved translation accuracy for Spanish. β‘ - May 30, 2025 π | |
**Website**: https://ghostainews.com/ | |
**Discord**: https://discord.gg/BfA23aYz |