Nick021402 commited on
Commit
c0743bb
Β·
verified Β·
1 Parent(s): 0bdd1cb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -18
README.md CHANGED
@@ -1,34 +1,39 @@
1
  ---
2
  license: mit
3
- title: πŸŽ™οΈ PodXplain
4
  sdk: gradio
5
  emoji: πŸ“š
6
  colorFrom: red
7
  colorTo: blue
8
  pinned: true
9
- short_description: PodXplain is a Hugging Face-hosted application that converts
10
  ---
11
- # πŸŽ™οΈ PodXplain
12
 
13
- **From script to story β€” voice it like never before.**
14
 
15
- PodXplain is a Hugging Face-hosted application that converts long-form text into engaging multi-speaker podcast-style audio. Simply input your script, and get a professional-sounding MP3 podcast with automatic speaker detection and assignment.
 
 
 
 
 
 
 
16
 
17
  ## ✨ Features
18
 
19
- - **πŸ“ Long-form Support**: Handle up to 50,000 characters of text
20
- - **🎭 Multi-speaker Audio**: Automatic speaker detection and assignment
21
- - **πŸ”„ Smart Segmentation**: Intelligent text splitting with progress tracking
22
- - **🎡 High-quality Output**: MP3 format for optimal file size and compatibility
23
- - **πŸš€ Real-time Progress**: Live updates during generation
24
- - **🎨 Modern UI**: Clean, intuitive Gradio interface
25
 
26
  ## πŸ› οΈ Tech Stack
27
 
28
- - **Frontend**: Gradio for interactive web interface
29
- - **TTS Engine**: Nari DIA 1.6B for natural voice synthesis (currently mocked)
30
- - **Audio Processing**: pydub for audio manipulation and MP3 conversion
31
- - **Hosting**: Hugging Face Spaces with GPU support
32
 
33
  ## πŸ“‹ How to Use
34
 
@@ -46,11 +51,14 @@ PodXplain is a Hugging Face-hosted application that converts long-form text into
46
 
47
  ```bash
48
  # Clone the repository
49
- git clone [https://github.com/yourusername/podxplain.git](https://github.com/yourusername/podxplain.git) # Replace with your actual repo URL
50
- cd podxplain
51
 
52
  # Install dependencies
53
  pip install -r requirements.txt
54
 
55
  # Run the application
56
- python app.py
 
 
 
 
1
  ---
2
  license: mit
3
+ title: πŸŽ™οΈ PodXplainClone
4
  sdk: gradio
5
  emoji: πŸ“š
6
  colorFrom: red
7
  colorTo: blue
8
  pinned: true
9
+ short_description: ' A CPU-friendly AI podcast generator using SpeechT5.'
10
  ---
 
11
 
 
12
 
13
+ ---
14
+ # πŸŽ™οΈ PodXplainClone
15
+
16
+ **From script to story β€” voice it like never before, even on CPU.**
17
+
18
+ **PodXplainClone** is an experimental Hugging Face Space designed to demonstrate the core functionality of an AI-powered podcast generator, specifically optimized to run efficiently on **CPU hardware**. It allows users to transform written dialogue or narrative text into a natural-sounding audio podcast with multiple distinct voices.
19
+
20
+ This space serves as a **CPU-friendly alternative and development sandbox** to the main PodXplain project (which is awaiting GPU resources for a more advanced model).
21
 
22
  ## ✨ Features
23
 
24
+ - **πŸ“ Long-form Support**: Handle up to 50,000 characters of text
25
+ - **🎭 Multi-speaker Audio**: Automatic speaker detection and assignment with distinct voices
26
+ - **πŸ”„ Smart Segmentation**: Intelligent text splitting with progress tracking
27
+ - **🎡 High-quality Output**: MP3 format for optimal file size and compatibility
28
+ - **πŸš€ Real-time Progress**: Live updates during generation
29
+ - **🎨 Modern UI**: Clean, intuitive Gradio interface
30
 
31
  ## πŸ› οΈ Tech Stack
32
 
33
+ - **Frontend**: Gradio for interactive web interface
34
+ - **TTS Engine**: `microsoft/speecht5_tts` for natural, multi-speaker voice synthesis (optimized for CPU)
35
+ - **Audio Processing**: `pydub` for audio manipulation and MP3 conversion
36
+ - **Hosting**: Hugging Face Spaces (currently configured for CPU Basic tier)
37
 
38
  ## πŸ“‹ How to Use
39
 
 
51
 
52
  ```bash
53
  # Clone the repository
54
+ git clone [https://huggingface.co/spaces/Nick021402/PodXplainClone](https://huggingface.co/spaces/Nick021402/PodXplainClone) # Clone this specific Space
55
+ cd PodXplainClone
56
 
57
  # Install dependencies
58
  pip install -r requirements.txt
59
 
60
  # Run the application
61
+ python app.py
62
+
63
+ Note: This space, PodXplainClone, is a separate effort to demonstrate the core features of the PodXplain project. It was specifically created to utilize Text-to-Speech models that do not require paid GPU hardware, thereby ensuring accessibility and functionality within Hugging Face's free CPU tiers. The original PodXplain project aims to integrate larger, more advanced TTS models like Nari DIA 1.6B when dedicated GPU access becomes available.
64
+ Developed by: Nick021402