metadata

title: Audio Translator
emoji: 🔥
colorFrom: pink
colorTo: purple
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: Audio Translator

🗣️ Audio Translator

[]
[]
[]
[]

🚀 Overview

Combine ASR, machine translation, and neural TTS into one seamless audio pipeline—100 % CPU on free-tier HF Spaces.
Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back.

AI buzzwords:
• Automatic Speech Recognition (ASR) • Whisper Tiny • Neural Machine Translation • GoogleTranslator • Text-to-Speech • gTTS • Multi-modal AI • End-to-End Inference • Real-Time • Edge Deployment

✨ Features

🔑 Feature	🔍 Description
🎙️ ASR: Whisper-Tiny	Lightning-fast, on-device speech transcription (all languages)
🌐 Translation	Bidirectional English ↔ Spanish via Deep-Translator
🗣️ Neural TTS	High-quality audio playback via the free Google Translate TTS
⚡ Zero-infra CPU	Runs on 2 vCPU / 16 GB RAM—no GPU or paid APIs needed
🎨 Elegant UI	Intuitive Gradio Blocks—upload, buttons, transcripts, audio
🔧 Fully Modular	Swap models or add logging/analytics with minimal edits

🏗️ Architecture & Workflow

Audio Upload
User uploads any .wav or .mp3 clip.
ASR
OpenAI’s whisper-tiny decodes speech into text.
MT
deep-translator’s GoogleTranslator converts text to chosen language.
TTS
gTTS synthesizes the translated text into an .mp3.
UI Rendering
Gradio presents the original transcript, the translation, and an audio player.

🛠️ Quick Start (Local Dev)

git clone https://github.com/<YOUR-USERNAME>/audio-translator.git
cd audio-translator
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python app.py

## Latest Update

- Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 📝
- Added support for additional audio formats. 🎙️ - June 21, 2025 📝
- Enhanced gTTS audio quality. - June 20, 2025 📝
- Improved translation accuracy for Spanish. ⚡ - June 19, 2025 📝
- Upgraded Whisper-Tiny model for faster ASR. - June 18, 2025 📝
- Optimized pipeline for lower latency. 🌐 - June 17, 2025 📝
- Added support for additional audio formats. 🎙️ - June 16, 2025 📝
- Enhanced gTTS audio quality. - June 15, 2025 📝
- Improved translation accuracy for Spanish. 🗣️ - June 14, 2025 📝
- Upgraded Whisper-Tiny model for faster ASR. - June 13, 2025 📝
- Optimized pipeline for lower latency. 🔥 - June 12, 2025 📝
- Added support for additional audio formats. ⚡ - June 11, 2025 📝
- Enhanced gTTS audio quality. - June 10, 2025 📝
- Improved translation accuracy for Spanish. 🎙️ - June 09, 2025 📝
- Upgraded Whisper-Tiny model for faster ASR. - June 08, 2025 📝
- Optimized pipeline for lower latency. 🔥 - June 07, 2025 📝
- Added support for additional audio formats. 🌐 - June 06, 2025 📝
- Enhanced gTTS audio quality. 🗣️ - June 05, 2025 📝
- Improved translation accuracy for Spanish. 🎙️ - June 04, 2025 📝
- Upgraded Whisper-Tiny model for faster ASR. 🌐 - June 03, 2025 📝
- Optimized pipeline for lower latency. 🔥 - June 02, 2025 📝
- Added support for additional audio formats. 🗣️ - June 01, 2025 📝
- Enhanced gTTS audio quality. - May 31, 2025 📝
- Improved translation accuracy for Spanish. ⚡ - May 30, 2025 📝

**Website**: https://ghostainews.com/
**Discord**: https://discord.gg/BfA23aYz