Spaces:

kevinwang676
/

11Labs-OpenVoice-v2

Runtime error

App Files Files Community

kevinwang676 commited on May 8, 2024

Commit

0d6d64d

verified ·

1 Parent(s): d64a75e

Update README.md

Browse files

Files changed (1) hide show

README.md +12 -74

README.md CHANGED Viewed

@@ -1,74 +1,12 @@
-<div align="center">
-  <div>&nbsp;</div>
-  <img src="resources/openvoicelogo.jpg" width="400"/>
-[Paper](https://arxiv.org/abs/2312.01479) |
-[Website](https://research.myshell.ai/open-voice)
-</div>
-## Introduction
-### OpenVoice V1
-As we detailed in our [paper](https://arxiv.org/abs/2312.01479) and [website](https://research.myshell.ai/open-voice), the advantages of OpenVoice are three-fold:
-**1. Accurate Tone Color Cloning.**
-OpenVoice can accurately clone the reference tone color and generate speech in multiple languages and accents.
-**2. Flexible Voice Style Control.**
-OpenVoice enables granular control over voice styles, such as emotion and accent, as well as other style parameters including rhythm, pauses, and intonation.
-**3. Zero-shot Cross-lingual Voice Cloning.**
-Neither of the language of the generated speech nor the language of the reference speech needs to be presented in the massive-speaker multi-lingual training dataset.
-### OpenVoice V2
-In April 2024, we released OpenVoice V2, which includes all features in V1 and has:
-**1. Better Audio Quality.**
-OpenVoice V2 adopts a different training strategy that delivers better audio quality.
-**2. Native Multi-lingual Support.**
-English, Spanish, French, Chinese, Japanese and Korean are natively supported in OpenVoice V2.
-**3. Free Commercial Use.**
-Starting from April 2024, both V2 and V1 are released under MIT License. Free for commercial use.
-[Video](https://github.com/myshell-ai/OpenVoice/assets/40556743/3cba936f-82bf-476c-9e52-09f0f417bb2f)
-OpenVoice has been powering the instant voice cloning capability of [myshell.ai](https://app.myshell.ai/explore) since May 2023. Until Nov 2023, the voice cloning model has been used tens of millions of times by users worldwide, and witnessed the explosive user growth on the platform.
-## Main Contributors
-- [Zengyi Qin](https://www.qinzy.tech) at MIT and MyShell
-- [Wenliang Zhao](https://wl-zhao.github.io) at Tsinghua University
-- [Xumin Yu](https://yuxumin.github.io) at Tsinghua University
-- [Ethan Sun](https://twitter.com/ethan_myshell) at MyShell
-## How to Use
-Please see [usage](docs/USAGE.md) for detailed instructions.
-## Common Issues
-Please see [QA](docs/QA.md) for common questions and answers. We will regularly update the question and answer list.
-## Join Our Community
-Join our [Discord community](https://discord.gg/myshell) and select the `Developer` role upon joining to gain exclusive access to our developer-only channel! Don't miss out on valuable discussions and collaboration opportunities.
-## Citation
-```
-@article{qin2023openvoice,
-  title={OpenVoice: Versatile Instant Voice Cloning},
-  author={Qin, Zengyi and Zhao, Wenliang and Yu, Xumin and Sun, Xin},
-  journal={arXiv preprint arXiv:2312.01479},
-  year={2023}
-}
-```
-## License
-OpenVoice V1 and V2 are MIT Licensed. Free for both commercial and research use.
-## Acknowledgements
-This implementation is based on several excellent projects, [TTS](https://github.com/coqui-ai/TTS), [VITS](https://github.com/jaywalnut310/vits), and [VITS2](https://github.com/daniilrobnikov/vits2). Thanks for their awesome work!

+title: 11Labs OpenVoice V2
+emoji: 🐨
+colorFrom: purple
+colorTo: blue
+sdk: gradio
+sdk_version: 4.29.0
+app_file: app.py
+pinned: false
+license: mit
+Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference