Spaces:

ghostai1
/

speechtranslate

Sleeping

App Files Files Community

speechtranslate / README.md

ghostai1

Update README.md

67a1108 verified 3 months ago

preview code

raw

history blame contribute delete

3.5 kB

	---
	title: Speechtranslate
	emoji: 🏆
	colorFrom: red
	colorTo: yellow
	sdk: gradio
	sdk_version: 5.31.0
	app_file: app.py
	pinned: false
	license: apache-2.0
	short_description: text2speech+translate
	---

	# 🌐💬 Translate & Speak + Session Log
	[![Hugging Face Space](https://img.shields.io/badge/HuggingFace-Spaces-blue?logo=huggingface)](https://huggingface.co/spaces/your-username/translate-speak-log)
	[![Gradio UI](https://img.shields.io/badge/Gradio-5.31.0-brightgreen?logo=gradio)](https://gradio.app)
	[![Python](https://img.shields.io/badge/Python-3.10-yellow?logo=python)](https://www.python.org/)
	[![License](https://img.shields.io/badge/License-MIT-lightgrey)](LICENSE)

	---

	## 🚀 Overview
	Harness the power of real-time NLP, on-the-fly translation, and neural TTS in one elegant, CPU-only pipeline. This Space transforms user text into spoken audio—any English or Spanish input gets auto-detected, translated, and voiced back—while maintaining a live session log for data-driven insights.

	Key AI buzzwords:
	> Natural Language Processing (NLP) • Neural Text-to-Speech • Zero-shot language detection • Real-time inference • Session state management • Cloud-native deployment • User-centric design • Cost-efficient CPU runtime

	---

	## ✨ Features

	\| 🔑 Feature \| 🔍 Description \|
	\|--------------------------------\|-------------------------------------------------------------------------------------------------------------\|
	\| 🔄 Bidirectional Translation \| English ↔ Spanish via `deep-translator`’s GoogleTranslator (auto-detect source language) \|
	\| 🗣️ Neural TTS \| High-fidelity speech generation with `gTTS` (Google Translate TTS) \|
	\| 🕒 Real-Time Processing \| Sub-second response on free CPU tier—no GPUs, no paid APIs \|
	\| 📊 Session Logging \| Data-driven UX: every input, translation, and audio event recorded in an interactive DataFrame \|
	\| 🎨 Interactive UI \| Sleek Gradio Blocks interface with controls for text input, language selector, and playback \|
	\| 🔧 Zero-Config Dev \| Drop-in `app.py` + `requirements.txt`—Spaces auto-builds and deploys \|
	\| 💡 Extensible Architecture \| Modular pipelines—swap translators, TTS engines, or add analytics with minimal code changes \|

	---

	## 🏗️ Architecture & Workflow

	1. User Input
	- Free-form text in any language (auto-detects English/Spanish).
	2. Translation
	- `deep-translator` → Google Translate API wrapper → high-accuracy text conversion.
	3. Text-to-Speech
	- `gTTS` → neural waveform synthesis → MP3 output.
	4. Session Log
	- Maintains a rolling table of `[Input, Target Language, Translated Text]` for audit trails and usage analytics.
	5. UI Rendering
	- Gradio Blocks orchestrates inputs, buttons, outputs, and state, delivering a seamless end-to-end experience.

	---

	## 🛠️ Quick Start (Local Development)

	```bash
	git clone https://github.com/your-username/translate-speak-log.git
	cd translate-speak-log
	python3 -m venv venv && source venv/bin/activate
	pip install -r requirements.txt
	python app.py