Audio-Translator / README.md
ghostai1's picture
Updated status on August 14, 2025
7dcbf4d verified

A newer version of the Gradio SDK is available: 5.42.0

Upgrade
metadata
title: Audio Translator
emoji: πŸ”₯
colorFrom: pink
colorTo: purple
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: Audio Translator

πŸ—£οΈ Audio Translator

Hugging Face Space
[Gradio UI]
[Model: Whisper Tiny]
[Translator: Deep-Translator]
[TTS: gTTS]
License


πŸš€ Overview

Combine ASR, machine translation, and neural TTS into one seamless audio pipelineβ€”100 % CPU on free-tier HF Spaces.
Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back.

AI buzzwords:
β€’ Automatic Speech Recognition (ASR) β€’ Whisper Tiny β€’ Neural Machine Translation β€’ GoogleTranslator β€’ Text-to-Speech β€’ gTTS β€’ Multi-modal AI β€’ End-to-End Inference β€’ Real-Time β€’ Edge Deployment


✨ Features

πŸ”‘ Feature πŸ” Description
πŸŽ™οΈ ASR: Whisper-Tiny Lightning-fast, on-device speech transcription (all languages)
🌐 Translation Bidirectional English ↔ Spanish via Deep-Translator
πŸ—£οΈ Neural TTS High-quality audio playback via the free Google Translate TTS
⚑ Zero-infra CPU Runs on 2 vCPU / 16 GB RAMβ€”no GPU or paid APIs needed
🎨 Elegant UI Intuitive Gradio Blocksβ€”upload, buttons, transcripts, audio
πŸ”§ Fully Modular Swap models or add logging/analytics with minimal edits

πŸ—οΈ Architecture & Workflow

  1. Audio Upload
    User uploads any .wav or .mp3 clip.
  2. ASR
    OpenAI’s whisper-tiny decodes speech into text.
  3. MT
    deep-translator’s GoogleTranslator converts text to chosen language.
  4. TTS
    gTTS synthesizes the translated text into an .mp3.
  5. UI Rendering
    Gradio presents the original transcript, the translation, and an audio player.

πŸ› οΈ Quick Start (Local Dev)

git clone https://github.com/<YOUR-USERNAME>/audio-translator.git
cd audio-translator
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python app.py

## Latest Update

- Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 πŸ“
- Optimized pipeline for lower latency. - August 14, 2025 πŸ“
- Added support for additional audio formats. πŸ—£οΈ - August 13, 2025 πŸ“
- Enhanced gTTS audio quality. 🌐 - August 12, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸ”₯ - August 11, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - August 10, 2025 πŸ“
- Optimized pipeline for lower latency. - August 09, 2025 πŸ“
- Added support for additional audio formats. πŸ”₯ - August 08, 2025 πŸ“
- Enhanced gTTS audio quality. πŸŽ™οΈ - August 07, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - August 06, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. πŸ—£οΈ - August 05, 2025 πŸ“
- Optimized pipeline for lower latency. - August 04, 2025 πŸ“
- Added support for additional audio formats. 🌐 - August 03, 2025 πŸ“
- Enhanced gTTS audio quality. - August 02, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - August 01, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - July 31, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ—£οΈ - July 30, 2025 πŸ“
- Added support for additional audio formats. πŸ”₯ - July 29, 2025 πŸ“
- Enhanced gTTS audio quality. ⚑ - July 28, 2025 πŸ“
- Improved translation accuracy for Spanish. - July 27, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - July 26, 2025 πŸ“
- Optimized pipeline for lower latency. - July 25, 2025 πŸ“
- Added support for additional audio formats. 🌐 - July 24, 2025 πŸ“
- Enhanced gTTS audio quality. - July 23, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - July 22, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - July 21, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ—£οΈ - July 20, 2025 πŸ“
- Added support for additional audio formats. πŸ”₯ - July 19, 2025 πŸ“
- Enhanced gTTS audio quality. 🌐 - July 18, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - July 17, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - July 16, 2025 πŸ“
- Optimized pipeline for lower latency. - July 15, 2025 πŸ“
- Added support for additional audio formats. 🌐 - July 11, 2025 πŸ“
- Enhanced gTTS audio quality. - July 10, 2025 πŸ“
- Improved translation accuracy for Spanish. - July 09, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. ⚑ - July 08, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ—£οΈ - July 07, 2025 πŸ“
- Added support for additional audio formats. - July 06, 2025 πŸ“
- Enhanced gTTS audio quality. πŸ”₯ - July 05, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - July 04, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - July 03, 2025 πŸ“
- Optimized pipeline for lower latency. - July 02, 2025 πŸ“
- Added support for additional audio formats. - July 01, 2025 πŸ“
- Enhanced gTTS audio quality. - June 30, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - June 29, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 28, 2025 πŸ“
- Optimized pipeline for lower latency. - June 27, 2025 πŸ“
- Added support for additional audio formats. - June 26, 2025 πŸ“
- Enhanced gTTS audio quality. 🌐 - June 25, 2025 πŸ“
- Improved translation accuracy for Spanish. - June 24, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. πŸ—£οΈ - June 23, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 22, 2025 πŸ“
- Added support for additional audio formats. πŸŽ™οΈ - June 21, 2025 πŸ“
- Enhanced gTTS audio quality. - June 20, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - June 19, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 18, 2025 πŸ“
- Optimized pipeline for lower latency. 🌐 - June 17, 2025 πŸ“
- Added support for additional audio formats. πŸŽ™οΈ - June 16, 2025 πŸ“
- Enhanced gTTS audio quality. - June 15, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸ—£οΈ - June 14, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 13, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 12, 2025 πŸ“
- Added support for additional audio formats. ⚑ - June 11, 2025 πŸ“
- Enhanced gTTS audio quality. - June 10, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - June 09, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 08, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 07, 2025 πŸ“
- Added support for additional audio formats. 🌐 - June 06, 2025 πŸ“
- Enhanced gTTS audio quality. πŸ—£οΈ - June 05, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - June 04, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. 🌐 - June 03, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 02, 2025 πŸ“
- Added support for additional audio formats. πŸ—£οΈ - June 01, 2025 πŸ“
- Enhanced gTTS audio quality. - May 31, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - May 30, 2025 πŸ“

**Website**: https://ghostainews.com/
**Discord**: https://discord.gg/BfA23aYz