Spaces:

Nick021402
/

PodXplainClone

Sleeping

App Files Files Community

Nick021402 commited on May 24

Commit

c0743bb

verified ·

1 Parent(s): 0bdd1cb

Update README.md

Browse files

Files changed (1) hide show

README.md +26 -18

README.md CHANGED Viewed

@@ -1,34 +1,39 @@
 ---
 license: mit
-title: 🎙️ PodXplain
 sdk: gradio
 emoji: 📚
 colorFrom: red
 colorTo: blue
 pinned: true
-short_description: PodXplain is a Hugging Face-hosted application that converts
 ---
-# 🎙️ PodXplain
-**From script to story — voice it like never before.**
-PodXplain is a Hugging Face-hosted application that converts long-form text into engaging multi-speaker podcast-style audio. Simply input your script, and get a professional-sounding MP3 podcast with automatic speaker detection and assignment.
 ## ✨ Features
-- **📝 Long-form Support**: Handle up to 50,000 characters of text
-- **🎭 Multi-speaker Audio**: Automatic speaker detection and assignment
-- **🔄 Smart Segmentation**: Intelligent text splitting with progress tracking
-- **🎵 High-quality Output**: MP3 format for optimal file size and compatibility
-- **🚀 Real-time Progress**: Live updates during generation
-- **🎨 Modern UI**: Clean, intuitive Gradio interface
 ## 🛠️ Tech Stack
-- **Frontend**: Gradio for interactive web interface
-- **TTS Engine**: Nari DIA 1.6B for natural voice synthesis (currently mocked)
-- **Audio Processing**: pydub for audio manipulation and MP3 conversion
-- **Hosting**: Hugging Face Spaces with GPU support
 ## 📋 How to Use
@@ -46,11 +51,14 @@ PodXplain is a Hugging Face-hosted application that converts long-form text into
 ```bash
 # Clone the repository
-git clone [https://github.com/yourusername/podxplain.git](https://github.com/yourusername/podxplain.git) # Replace with your actual repo URL
-cd podxplain
 # Install dependencies
 pip install -r requirements.txt
 # Run the application
-python app.py

 ---
 license: mit
+title: 🎙️ PodXplainClone
 sdk: gradio
 emoji: 📚
 colorFrom: red
 colorTo: blue
 pinned: true
+short_description: ' A CPU-friendly AI podcast generator using SpeechT5.'
 ---
+---
+# 🎙️ PodXplainClone
+**From script to story — voice it like never before, even on CPU.**
+**PodXplainClone** is an experimental Hugging Face Space designed to demonstrate the core functionality of an AI-powered podcast generator, specifically optimized to run efficiently on **CPU hardware**. It allows users to transform written dialogue or narrative text into a natural-sounding audio podcast with multiple distinct voices.
+This space serves as a **CPU-friendly alternative and development sandbox** to the main PodXplain project (which is awaiting GPU resources for a more advanced model).
 ## ✨ Features
+-   **📝 Long-form Support**: Handle up to 50,000 characters of text
+-   **🎭 Multi-speaker Audio**: Automatic speaker detection and assignment with distinct voices
+-   **🔄 Smart Segmentation**: Intelligent text splitting with progress tracking
+-   **🎵 High-quality Output**: MP3 format for optimal file size and compatibility
+-   **🚀 Real-time Progress**: Live updates during generation
+-   **🎨 Modern UI**: Clean, intuitive Gradio interface
 ## 🛠️ Tech Stack
+-   **Frontend**: Gradio for interactive web interface
+-   **TTS Engine**: `microsoft/speecht5_tts` for natural, multi-speaker voice synthesis (optimized for CPU)
+-   **Audio Processing**: `pydub` for audio manipulation and MP3 conversion
+-   **Hosting**: Hugging Face Spaces (currently configured for CPU Basic tier)
 ## 📋 How to Use
 ```bash
 # Clone the repository
+git clone [https://huggingface.co/spaces/Nick021402/PodXplainClone](https://huggingface.co/spaces/Nick021402/PodXplainClone) # Clone this specific Space
+cd PodXplainClone
 # Install dependencies
 pip install -r requirements.txt
 # Run the application
+python app.py
+Note: This space, PodXplainClone, is a separate effort to demonstrate the core features of the PodXplain project. It was specifically created to utilize Text-to-Speech models that do not require paid GPU hardware, thereby ensuring accessibility and functionality within Hugging Face's free CPU tiers. The original PodXplain project aims to integrate larger, more advanced TTS models like Nari DIA 1.6B when dedicated GPU access becomes available.
+Developed by: Nick021402