Audio-Translator / README.md
ghostai1's picture
Updated status on May 30, 2025
1ad0bb5 verified
|
raw
history blame
3.02 kB
metadata
title: Audio Translator
emoji: πŸ”₯
colorFrom: pink
colorTo: purple
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: Audio Translator

πŸ—£οΈ Audio Translator

Hugging Face Space
[Gradio UI]
[Model: Whisper Tiny]
[Translator: Deep-Translator]
[TTS: gTTS]
License


πŸš€ Overview

Combine ASR, machine translation, and neural TTS into one seamless audio pipelineβ€”100 % CPU on free-tier HF Spaces.
Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back.

AI buzzwords:
β€’ Automatic Speech Recognition (ASR) β€’ Whisper Tiny β€’ Neural Machine Translation β€’ GoogleTranslator β€’ Text-to-Speech β€’ gTTS β€’ Multi-modal AI β€’ End-to-End Inference β€’ Real-Time β€’ Edge Deployment


✨ Features

πŸ”‘ Feature πŸ” Description
πŸŽ™οΈ ASR: Whisper-Tiny Lightning-fast, on-device speech transcription (all languages)
🌐 Translation Bidirectional English ↔ Spanish via Deep-Translator
πŸ—£οΈ Neural TTS High-quality audio playback via the free Google Translate TTS
⚑ Zero-infra CPU Runs on 2 vCPU / 16 GB RAMβ€”no GPU or paid APIs needed
🎨 Elegant UI Intuitive Gradio Blocksβ€”upload, buttons, transcripts, audio
πŸ”§ Fully Modular Swap models or add logging/analytics with minimal edits

πŸ—οΈ Architecture & Workflow

  1. Audio Upload
    User uploads any .wav or .mp3 clip.
  2. ASR
    OpenAI’s whisper-tiny decodes speech into text.
  3. MT
    deep-translator’s GoogleTranslator converts text to chosen language.
  4. TTS
    gTTS synthesizes the translated text into an .mp3.
  5. UI Rendering
    Gradio presents the original transcript, the translation, and an audio player.

πŸ› οΈ Quick Start (Local Dev)

git clone https://github.com/<YOUR-USERNAME>/audio-translator.git
cd audio-translator
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python app.py

## Latest Update

- Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 πŸ“
- Improved translation accuracy for Spanish. ⚑ - May 30, 2025 πŸ“

**Website**: https://ghostainews.com/
**Discord**: https://discord.gg/BfA23aYz