Audio-Translator / README.md
ghostai1's picture
Updated status on June 21, 2025
651c442 verified
|
raw
history blame
4.48 kB
metadata
title: Audio Translator
emoji: πŸ”₯
colorFrom: pink
colorTo: purple
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: Audio Translator

πŸ—£οΈ Audio Translator

Hugging Face Space
[Gradio UI]
[Model: Whisper Tiny]
[Translator: Deep-Translator]
[TTS: gTTS]
License


πŸš€ Overview

Combine ASR, machine translation, and neural TTS into one seamless audio pipelineβ€”100 % CPU on free-tier HF Spaces.
Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back.

AI buzzwords:
β€’ Automatic Speech Recognition (ASR) β€’ Whisper Tiny β€’ Neural Machine Translation β€’ GoogleTranslator β€’ Text-to-Speech β€’ gTTS β€’ Multi-modal AI β€’ End-to-End Inference β€’ Real-Time β€’ Edge Deployment


✨ Features

πŸ”‘ Feature πŸ” Description
πŸŽ™οΈ ASR: Whisper-Tiny Lightning-fast, on-device speech transcription (all languages)
🌐 Translation Bidirectional English ↔ Spanish via Deep-Translator
πŸ—£οΈ Neural TTS High-quality audio playback via the free Google Translate TTS
⚑ Zero-infra CPU Runs on 2 vCPU / 16 GB RAMβ€”no GPU or paid APIs needed
🎨 Elegant UI Intuitive Gradio Blocksβ€”upload, buttons, transcripts, audio
πŸ”§ Fully Modular Swap models or add logging/analytics with minimal edits

πŸ—οΈ Architecture & Workflow

  1. Audio Upload
    User uploads any .wav or .mp3 clip.
  2. ASR
    OpenAI’s whisper-tiny decodes speech into text.
  3. MT
    deep-translator’s GoogleTranslator converts text to chosen language.
  4. TTS
    gTTS synthesizes the translated text into an .mp3.
  5. UI Rendering
    Gradio presents the original transcript, the translation, and an audio player.

πŸ› οΈ Quick Start (Local Dev)

git clone https://github.com/<YOUR-USERNAME>/audio-translator.git
cd audio-translator
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python app.py

## Latest Update

- Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 πŸ“
- Added support for additional audio formats. πŸŽ™οΈ - June 21, 2025 πŸ“
- Enhanced gTTS audio quality. - June 20, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - June 19, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 18, 2025 πŸ“
- Optimized pipeline for lower latency. 🌐 - June 17, 2025 πŸ“
- Added support for additional audio formats. πŸŽ™οΈ - June 16, 2025 πŸ“
- Enhanced gTTS audio quality. - June 15, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸ—£οΈ - June 14, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 13, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 12, 2025 πŸ“
- Added support for additional audio formats. ⚑ - June 11, 2025 πŸ“
- Enhanced gTTS audio quality. - June 10, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - June 09, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. - June 08, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 07, 2025 πŸ“
- Added support for additional audio formats. 🌐 - June 06, 2025 πŸ“
- Enhanced gTTS audio quality. πŸ—£οΈ - June 05, 2025 πŸ“
- Improved translation accuracy for Spanish. πŸŽ™οΈ - June 04, 2025 πŸ“
- Upgraded Whisper-Tiny model for faster ASR. 🌐 - June 03, 2025 πŸ“
- Optimized pipeline for lower latency. πŸ”₯ - June 02, 2025 πŸ“
- Added support for additional audio formats. πŸ—£οΈ - June 01, 2025 πŸ“
- Enhanced gTTS audio quality. - May 31, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - May 30, 2025 πŸ“

**Website**: https://ghostainews.com/
**Discord**: https://discord.gg/BfA23aYz