Spaces:

ghostai1
/

speechtranslate

Sleeping

App Files Files Community

ghostai1 commited on May 28

Commit

67a1108

verified ·

1 Parent(s): e94cef8

Update README.md

Browse files

Files changed (1) hide show

README.md +36 -25

README.md CHANGED Viewed

@@ -10,46 +10,57 @@ pinned: false
 license: apache-2.0
 short_description: text2speech+translate
 ---
-# 🌐💬  Instant Translator & Text-to-Speech
-[![HF Space](https://img.shields.io/badge/HuggingFace-Spaces-blue?logo=huggingface)](https://huggingface.co/spaces/<your-space>)
-[![Gradio UI](https://img.shields.io/badge/Gradio-5.31-green?logo=gradio)](https://gradio.app)
-[![Model Card](https://img.shields.io/badge/Models-Opus-MT%20%7C%20Coqui%20TTS-orange)](#models)
-[![License](https://img.shields.io/github/license/<you>/translator-tts)](LICENSE)
 ---
-## 🚀  What this Space does
-Type a sentence in **English or Spanish**, pick the target language, click **Translate & Speak** —
-and instantly hear the translated sentence spoken back to you.
-Runs 100 % on **free CPU** hardware (no GPUs, no paid APIs).
 ---
-## ✨  Why you’ll love it
-| ⭐ | Feature | Why it matters |
-|----|---------|----------------|
-| 🔄 | **Bidirectional translation** | English ↔ Spanish with top-tier Opus-MT models. |
-| 🔊 | **Natural speech** | Coqui TTS Tacotron2 voices for both languages. |
-| ⚡ | **Zero latency overhead** | < 1 s average translation+TTS on free Space CPU. |
-| 🖱️ | **One-click UI** | Gradio Blocks layout; no install, no login needed. |
-| 💸 | **Cost-free** | All models < 200 MB each; fits HF free-tier RAM & disk. |
-| 🔧 | **Easily extensible** | Swap models to add new languages or voices in minutes. |
 ---
-## 🖼️  Live Demo
-Open the Space &ndash; talk to it!
-➡ `https://<username>-translator-tts.hf.space`
 ---
-## 🏗️  Quick Start (Local Dev)
 ```bash
-git clone https://github.com/<you>/translator-tts.git
-cd translator-tts
-python -m venv venv && source venv/bin/activate
 pip install -r requirements.txt
 python app.py

 license: apache-2.0
 short_description: text2speech+translate
 ---
+# 🌐💬 Translate & Speak + Session Log
+[![Hugging Face Space](https://img.shields.io/badge/HuggingFace-Spaces-blue?logo=huggingface)](https://huggingface.co/spaces/your-username/translate-speak-log)
+[![Gradio UI](https://img.shields.io/badge/Gradio-5.31.0-brightgreen?logo=gradio)](https://gradio.app)
+[![Python](https://img.shields.io/badge/Python-3.10-yellow?logo=python)](https://www.python.org/)
+[![License](https://img.shields.io/badge/License-MIT-lightgrey)](LICENSE)
 ---
+## 🚀 Overview
+Harness the power of **real-time NLP**, **on-the-fly translation**, and **neural TTS** in one elegant, CPU-only pipeline. This Space transforms user text into spoken audio—any English or Spanish input gets auto-detected, translated, and voiced back—while maintaining a live session log for data-driven insights.
+**Key AI buzzwords:**
+> Natural Language Processing (NLP) • Neural Text-to-Speech • Zero-shot language detection • Real-time inference • Session state management • Cloud-native deployment • User-centric design • Cost-efficient CPU runtime
 ---
+## ✨ Features
+| 🔑 Feature                     | 🔍 Description                                                                                             |
+|--------------------------------|-------------------------------------------------------------------------------------------------------------|
+| **🔄 Bidirectional Translation** | English ↔ Spanish via `deep-translator`’s GoogleTranslator (auto-detect source language)                    |
+| **🗣️ Neural TTS**               | High-fidelity speech generation with `gTTS` (Google Translate TTS)                                          |
+| **🕒 Real-Time Processing**      | Sub-second response on free CPU tier—no GPUs, no paid APIs                                                  |
+| **📊 Session Logging**          | Data-driven UX: every input, translation, and audio event recorded in an interactive DataFrame               |
+| **🎨 Interactive UI**           | Sleek Gradio Blocks interface with controls for text input, language selector, and playback                  |
+| **🔧 Zero-Config Dev**          | Drop-in `app.py` + `requirements.txt`—Spaces auto-builds and deploys                                          |
+| **💡 Extensible Architecture**   | Modular pipelines—swap translators, TTS engines, or add analytics with minimal code changes                  |
 ---
+## 🏗️ Architecture & Workflow
+1. **User Input**
+   - Free-form text in any language (auto-detects English/Spanish).
+2. **Translation**
+   - `deep-translator` → Google Translate API wrapper → high-accuracy text conversion.
+3. **Text-to-Speech**
+   - `gTTS` → neural waveform synthesis → MP3 output.
+4. **Session Log**
+   - Maintains a rolling table of `[Input, Target Language, Translated Text]` for audit trails and usage analytics.
+5. **UI Rendering**
+   - Gradio Blocks orchestrates inputs, buttons, outputs, and state, delivering a seamless end-to-end experience.
 ---
+## 🛠️ Quick Start (Local Development)
 ```bash
+git clone https://github.com/your-username/translate-speak-log.git
+cd translate-speak-log
+python3 -m venv venv && source venv/bin/activate
 pip install -r requirements.txt
 python app.py