ghostai1 commited on
Commit
f70eb7d
Β·
verified Β·
1 Parent(s): f74d6c0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -14
README.md CHANGED
@@ -6,22 +6,44 @@ colorTo: indigo
6
  sdk: static
7
  pinned: false
8
  license: apache-2.0
9
- short_description: My First Binrary tree into AI
10
  ---
11
 
12
- # Nerfies
13
 
14
- This is the repository that contains source code for the [Nerfies website](https://nerfies.github.io).
15
 
16
- If you find Nerfies useful for your work please cite:
17
- ```
18
- @article{park2021nerfies
19
- author = {Park, Keunhong and Sinha, Utkarsh and Barron, Jonathan T. and Bouaziz, Sofien and Goldman, Dan B and Seitz, Steven M. and Martin-Brualla, Ricardo},
20
- title = {Nerfies: Deformable Neural Radiance Fields},
21
- journal = {ICCV},
22
- year = {2021},
23
- }
24
- ```
25
 
26
- # Website License
27
- <a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by-sa/4.0/88x31.png" /></a><br />This work is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/">Creative Commons Attribution-ShareAlike 4.0 International License</a>.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  sdk: static
7
  pinned: false
8
  license: apache-2.0
9
+ short_description: My First Binary Tree into AI
10
  ---
11
 
12
+ # GHOSTVOICECBR
13
 
14
+ **GHOSTVOICECBR** is a real-time voice cloning framework built using a novel Case-Based Reasoning (CBR) binary tree and quadrant-based tone mapping system. It enables low-latency, agentic voice synthesis using emotional quads β€” pitch, timbre, speed, and mood β€” mapped onto cloned neural speech vectors.
15
 
16
+ This system is the voice synthesis engine behind the **GhostVoice AI project**, capable of generating cloned speech in real time, optimized for 8–12 GB VRAM and deployable via Hugging Face Spaces, Gradio, or Twitch integrations.
 
 
 
 
 
 
 
 
17
 
18
+ ---
19
+
20
+ ## ✨ Features
21
+
22
+ - βœ… **CBR Tree Traversal** for selecting voice tones over time
23
+ - βœ… **Quad Mapping Engine** for emotion-driven synthesis
24
+ - βœ… **Fast Inference** on local hardware (no cloud GPU required)
25
+ - βœ… **Speaker Embedding Support** for personalized cloning
26
+ - βœ… **Live Gradio UI** with console, waveform, and quad controls
27
+ - βœ… **Twitch VoiceBot Ready** (optional module)
28
+
29
+ ---
30
+
31
+ ## 🧠 Core Concepts
32
+
33
+ - **CBR Binary Tree**: Stores and retrieves historical tone vectors efficiently.
34
+ - **Quad Mapping**: Each speech sample is mapped using a 4D vector:
35
+ - `pitch`
36
+ - `speed`
37
+ - `timbre`
38
+ - `emotion`
39
+ - **Voice Matching**: Nearest neighbor match + synthetic generation
40
+ - **Open Format**: Easily extensible to other TTS models or APIs (Bark, MusicGen, etc.)
41
+
42
+ ---
43
+
44
+ ## πŸš€ How to Use
45
+
46
+ 1. Clone or fork this repo
47
+ 2. Install dependencies:
48
+ ```bash
49
+ pip install -r requirements.txt