Audio-Translator / README.md
ghostai1's picture
Updated status on August 14, 2025
7dcbf4d verified
---
title: Audio Translator
emoji: πŸ”₯
colorFrom: pink
colorTo: purple
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: Audio Translator
---
# πŸ—£οΈ Audio Translator
[![Hugging Face Space](https://img.shields.io/badge/HuggingFace-Spaces-blue?logo=huggingface)](https://huggingface.co/spaces/<YOUR-USERNAME>/audio-translator)
[![Gradio UI](https://img.shields.io/badge/Gradio-5.31.0-brightgreen?logo=gradio)]
[![Model: Whisper Tiny](https://img.shields.io/badge/ASR-Whisper--tiny-orange)]
[![Translator: Deep-Translator](https://img.shields.io/badge/Translator-GoogleTranslator-blue)]
[![TTS: gTTS](https://img.shields.io/badge/TTS-gTTS-yellow)]
[![License](https://img.shields.io/badge/License-MIT-lightgrey)](LICENSE)
---
## πŸš€ Overview
Combine **ASR**, **machine translation**, and **neural TTS** into one **seamless audio pipeline**β€”100 % **CPU** on free-tier HF Spaces.
Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back.
> **AI buzzwords:**
> β€’ Automatic Speech Recognition (ASR) β€’ Whisper Tiny β€’ Neural Machine Translation β€’ GoogleTranslator β€’ Text-to-Speech β€’ gTTS β€’ Multi-modal AI β€’ End-to-End Inference β€’ Real-Time β€’ Edge Deployment
---
## ✨ Features
| πŸ”‘ Feature | πŸ” Description |
|---------------------------|---------------------------------------------------------------|
| **πŸŽ™οΈ ASR: Whisper-Tiny** | Lightning-fast, on-device speech transcription (all languages) |
| **🌐 Translation** | Bidirectional English ↔ Spanish via Deep-Translator |
| **πŸ—£οΈ Neural TTS** | High-quality audio playback via the free Google Translate TTS |
| **⚑ Zero-infra CPU** | Runs on 2 vCPU / 16 GB RAMβ€”no GPU or paid APIs needed |
| **🎨 Elegant UI** | Intuitive Gradio Blocksβ€”upload, buttons, transcripts, audio |
| **πŸ”§ Fully Modular** | Swap models or add logging/analytics with minimal edits |
---
## πŸ—οΈ Architecture & Workflow
1. **Audio Upload**
User uploads any `.wav` or `.mp3` clip.
2. **ASR**
OpenAI’s `whisper-tiny` decodes speech into text.
3. **MT**
`deep-translator`’s GoogleTranslator converts text to chosen language.
4. **TTS**
`gTTS` synthesizes the translated text into an `.mp3`.
5. **UI Rendering**
Gradio presents the original transcript, the translation, and an audio player.
---
## πŸ› οΈ Quick Start (Local Dev)
```bash
git clone https://github.com/<YOUR-USERNAME>/audio-translator.git
cd audio-translator
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python app.py
## Latest Update
- Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 πŸ“
- Optimized pipeline for lower latency. - August 14, 2025 πŸ“
- Added support for additional audio formats. πŸ—£οΈ - August 13, 2025 πŸ“
- Enhanced gTTS audio quality. 🌐 - August 12, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸ”₯ - August 11, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - August 10, 2025 πŸ“
- Optimized pipeline for lower latency. - August 09, 2025 πŸ“
- Added support for additional audio formats. πŸ”₯ - August 08, 2025 πŸ“
- Enhanced gTTS audio quality. πŸŽ™οΈ - August 07, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - August 06, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. πŸ—£οΈ - August 05, 2025 πŸ“
- Optimized pipeline for lower latency. - August 04, 2025 πŸ“
- Added support for additional audio formats. 🌐 - August 03, 2025 πŸ“
- Enhanced gTTS audio quality. - August 02, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - August 01, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - July 31, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ—£οΈ - July 30, 2025 πŸ“
- Added support for additional audio formats. πŸ”₯ - July 29, 2025 πŸ“
- Enhanced gTTS audio quality. ⚑ - July 28, 2025 πŸ“
- Improved translation accuracy for Spanish. - July 27, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - July 26, 2025 πŸ“
- Optimized pipeline for lower latency. - July 25, 2025 πŸ“
- Added support for additional audio formats. 🌐 - July 24, 2025 πŸ“
- Enhanced gTTS audio quality. - July 23, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - July 22, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - July 21, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ—£οΈ - July 20, 2025 πŸ“
- Added support for additional audio formats. πŸ”₯ - July 19, 2025 πŸ“
- Enhanced gTTS audio quality. 🌐 - July 18, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - July 17, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - July 16, 2025 πŸ“
- Optimized pipeline for lower latency. - July 15, 2025 πŸ“
- Added support for additional audio formats. 🌐 - July 11, 2025 πŸ“
- Enhanced gTTS audio quality. - July 10, 2025 πŸ“
- Improved translation accuracy for Spanish. - July 09, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. ⚑ - July 08, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ—£οΈ - July 07, 2025 πŸ“
- Added support for additional audio formats. - July 06, 2025 πŸ“
- Enhanced gTTS audio quality. πŸ”₯ - July 05, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - July 04, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - July 03, 2025 πŸ“
- Optimized pipeline for lower latency. - July 02, 2025 πŸ“
- Added support for additional audio formats. - July 01, 2025 πŸ“
- Enhanced gTTS audio quality. - June 30, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - June 29, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 28, 2025 πŸ“
- Optimized pipeline for lower latency. - June 27, 2025 πŸ“
- Added support for additional audio formats. - June 26, 2025 πŸ“
- Enhanced gTTS audio quality. 🌐 - June 25, 2025 πŸ“
- Improved translation accuracy for Spanish. - June 24, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. πŸ—£οΈ - June 23, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 22, 2025 πŸ“
- Added support for additional audio formats. πŸŽ™οΈ - June 21, 2025 πŸ“
- Enhanced gTTS audio quality. - June 20, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - June 19, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 18, 2025 πŸ“
- Optimized pipeline for lower latency. 🌐 - June 17, 2025 πŸ“
- Added support for additional audio formats. πŸŽ™οΈ - June 16, 2025 πŸ“
- Enhanced gTTS audio quality. - June 15, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸ—£οΈ - June 14, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 13, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 12, 2025 πŸ“
- Added support for additional audio formats. ⚑ - June 11, 2025 πŸ“
- Enhanced gTTS audio quality. - June 10, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - June 09, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 08, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 07, 2025 πŸ“
- Added support for additional audio formats. 🌐 - June 06, 2025 πŸ“
- Enhanced gTTS audio quality. πŸ—£οΈ - June 05, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - June 04, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. 🌐 - June 03, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 02, 2025 πŸ“
- Added support for additional audio formats. πŸ—£οΈ - June 01, 2025 πŸ“
- Enhanced gTTS audio quality. - May 31, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - May 30, 2025 πŸ“
**Website**: https://ghostainews.com/
**Discord**: https://discord.gg/BfA23aYz