Spaces:
Running
Running
File size: 7,936 Bytes
2f4675d 16959a9 525f362 19e1430 7dcbf4d afdb2c8 39696d3 211e964 39f35ed 1137ef7 63605f4 ac07636 b11dc80 935279b ad49cce 5ab807c f0eea6b 189f146 6f12f73 7e16736 fa6ecbe d1275f1 3a5ced3 e84b228 a23c8ff fc5b579 934e5e2 92cdb25 7e35c4d 8022b55 953bc00 c3d12fe 31bda4f feb233c 4248ee6 2807021 ebe810e fb2e4b5 6b7577b 9a72501 fa648f8 086e8ed 3b8f4fd 3cc2412 f4ae7d6 6e23d0a 82e494a 50eb112 cc73688 0a9fd9f 2767337 50fb7a3 bfbba7b bdd61bb 145e779 651c442 5b6115f 8a88a1d 0f25a71 7361941 d47dce0 c3f8e54 90b0844 c80827e e1dfcbb 039cd1a e674d91 f66de24 92e9176 ddd2301 81843dc de35a85 a15e1c1 a7f66e9 229634c e5f9753 5de3ba1 1ad0bb5 525f362 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 |
---
title: Audio Translator
emoji: π₯
colorFrom: pink
colorTo: purple
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: Audio Translator
---
# π£οΈ Audio Translator
[](https://huggingface.co/spaces/<YOUR-USERNAME>/audio-translator)
[]
[]
[]
[]
[](LICENSE)
---
## π Overview
Combine **ASR**, **machine translation**, and **neural TTS** into one **seamless audio pipeline**β100 % **CPU** on free-tier HF Spaces.
Upload speech, auto-detect language, translate into English or Spanish, then hear it spoken back.
> **AI buzzwords:**
> β’ Automatic Speech Recognition (ASR) β’ Whisper Tiny β’ Neural Machine Translation β’ GoogleTranslator β’ Text-to-Speech β’ gTTS β’ Multi-modal AI β’ End-to-End Inference β’ Real-Time β’ Edge Deployment
---
## β¨ Features
| π Feature | π Description |
|---------------------------|---------------------------------------------------------------|
| **ποΈ ASR: Whisper-Tiny** | Lightning-fast, on-device speech transcription (all languages) |
| **π Translation** | Bidirectional English β Spanish via Deep-Translator |
| **π£οΈ Neural TTS** | High-quality audio playback via the free Google Translate TTS |
| **β‘ Zero-infra CPU** | Runs on 2 vCPU / 16 GB RAMβno GPU or paid APIs needed |
| **π¨ Elegant UI** | Intuitive Gradio Blocksβupload, buttons, transcripts, audio |
| **π§ Fully Modular** | Swap models or add logging/analytics with minimal edits |
---
## ποΈ Architecture & Workflow
1. **Audio Upload**
User uploads any `.wav` or `.mp3` clip.
2. **ASR**
OpenAIβs `whisper-tiny` decodes speech into text.
3. **MT**
`deep-translator`βs GoogleTranslator converts text to chosen language.
4. **TTS**
`gTTS` synthesizes the translated text into an `.mp3`.
5. **UI Rendering**
Gradio presents the original transcript, the translation, and an audio player.
---
## π οΈ Quick Start (Local Dev)
```bash
git clone https://github.com/<YOUR-USERNAME>/audio-translator.git
cd audio-translator
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python app.py
## Latest Update
- Upgraded Whisper-Tiny model for faster ASR. - May 29, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. ποΈ - August 15, 2025 π
- Optimized pipeline for lower latency. - August 14, 2025 π
- Added support for additional audio formats. π£οΈ - August 13, 2025 π
- Enhanced gTTS audio quality. π - August 12, 2025 π
- Improved translation accuracy for Spanish. π₯ - August 11, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - August 10, 2025 π
- Optimized pipeline for lower latency. - August 09, 2025 π
- Added support for additional audio formats. π₯ - August 08, 2025 π
- Enhanced gTTS audio quality. ποΈ - August 07, 2025 π
- Improved translation accuracy for Spanish. β‘ - August 06, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. π£οΈ - August 05, 2025 π
- Optimized pipeline for lower latency. - August 04, 2025 π
- Added support for additional audio formats. π - August 03, 2025 π
- Enhanced gTTS audio quality. - August 02, 2025 π
- Improved translation accuracy for Spanish. ποΈ - August 01, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - July 31, 2025 π
- Optimized pipeline for lower latency. π£οΈ - July 30, 2025 π
- Added support for additional audio formats. π₯ - July 29, 2025 π
- Enhanced gTTS audio quality. β‘ - July 28, 2025 π
- Improved translation accuracy for Spanish. - July 27, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - July 26, 2025 π
- Optimized pipeline for lower latency. - July 25, 2025 π
- Added support for additional audio formats. π - July 24, 2025 π
- Enhanced gTTS audio quality. - July 23, 2025 π
- Improved translation accuracy for Spanish. ποΈ - July 22, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - July 21, 2025 π
- Optimized pipeline for lower latency. π£οΈ - July 20, 2025 π
- Added support for additional audio formats. π₯ - July 19, 2025 π
- Enhanced gTTS audio quality. π - July 18, 2025 π
- Improved translation accuracy for Spanish. β‘ - July 17, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - July 16, 2025 π
- Optimized pipeline for lower latency. - July 15, 2025 π
- Added support for additional audio formats. π - July 11, 2025 π
- Enhanced gTTS audio quality. - July 10, 2025 π
- Improved translation accuracy for Spanish. - July 09, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. β‘ - July 08, 2025 π
- Optimized pipeline for lower latency. π£οΈ - July 07, 2025 π
- Added support for additional audio formats. - July 06, 2025 π
- Enhanced gTTS audio quality. π₯ - July 05, 2025 π
- Improved translation accuracy for Spanish. ποΈ - July 04, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - July 03, 2025 π
- Optimized pipeline for lower latency. - July 02, 2025 π
- Added support for additional audio formats. - July 01, 2025 π
- Enhanced gTTS audio quality. - June 30, 2025 π
- Improved translation accuracy for Spanish. β‘ - June 29, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 28, 2025 π
- Optimized pipeline for lower latency. - June 27, 2025 π
- Added support for additional audio formats. - June 26, 2025 π
- Enhanced gTTS audio quality. π - June 25, 2025 π
- Improved translation accuracy for Spanish. - June 24, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. π£οΈ - June 23, 2025 π
- Optimized pipeline for lower latency. π₯ - June 22, 2025 π
- Added support for additional audio formats. ποΈ - June 21, 2025 π
- Enhanced gTTS audio quality. - June 20, 2025 π
- Improved translation accuracy for Spanish. β‘ - June 19, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 18, 2025 π
- Optimized pipeline for lower latency. π - June 17, 2025 π
- Added support for additional audio formats. ποΈ - June 16, 2025 π
- Enhanced gTTS audio quality. - June 15, 2025 π
- Improved translation accuracy for Spanish. π£οΈ - June 14, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 13, 2025 π
- Optimized pipeline for lower latency. π₯ - June 12, 2025 π
- Added support for additional audio formats. β‘ - June 11, 2025 π
- Enhanced gTTS audio quality. - June 10, 2025 π
- Improved translation accuracy for Spanish. ποΈ - June 09, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. - June 08, 2025 π
- Optimized pipeline for lower latency. π₯ - June 07, 2025 π
- Added support for additional audio formats. π - June 06, 2025 π
- Enhanced gTTS audio quality. π£οΈ - June 05, 2025 π
- Improved translation accuracy for Spanish. ποΈ - June 04, 2025 π
- Upgraded Whisper-Tiny model for faster ASR. π - June 03, 2025 π
- Optimized pipeline for lower latency. π₯ - June 02, 2025 π
- Added support for additional audio formats. π£οΈ - June 01, 2025 π
- Enhanced gTTS audio quality. - May 31, 2025 π
- Improved translation accuracy for Spanish. β‘ - May 30, 2025 π
**Website**: https://ghostainews.com/
**Discord**: https://discord.gg/BfA23aYz |