|
# Seed-VC |
|
[](https://huggingface.co/spaces/Plachta/Seed-VC) [](https://arxiv.org/abs/2411.09943) |
|
|
|
*[English](README.md) | [ç®äœäžæ](README-ZH.md) | æ¥æ¬èª* |
|
|
|
[real-time-demo.webm](https://github.com/user-attachments/assets/86325c5e-f7f6-4a04-8695-97275a5d046c) |
|
|
|
*(泚æïŒãã®ææžã¯æ©æ¢°ç¿»èš³ã«ãã£ãŠçæããããã®ã§ããæ£ç¢ºæ§ã確ä¿ããããåªããŠããŸãããäžæç¢ºãªç¹ãããããŸãããè±èªçããåç
§ãã ãããç¿»èš³ã®æ¹åæ¡ãããããŸããããPRãæè¿ããããŸãã)* |
|
|
|
çŸåšãªãªãŒã¹ãããŠããã¢ãã«ã¯ã*ãŒãã·ã§ããé³å£°å€æ* ðã*ãŒãã·ã§ãããªã¢ã«ã¿ã€ã é³å£°å€æ* ð£ïžã*ãŒãã·ã§ããæå£°å€æ* ð¶ ã«å¯Ÿå¿ããŠããŸãããã¬ãŒãã³ã°ãªãã§ã1ã30ç§ã®åç
§é³å£°ãããã€ã¹ã¯ããŒãã³ã°ãå¯èœã§ãã |
|
|
|
ã«ã¹ã¿ã ããŒã¿ã§ã®è¿œå ãã¡ã€ã³ãã¥ãŒãã³ã°ããµããŒãããŠãããç¹å®ã®è©±è
/話è
矀ã«å¯Ÿããããã©ãŒãã³ã¹ãåäžãããããšãã§ããŸããããŒã¿èŠä»¶ã¯æ¥µããŠå°ãªãïŒ**話è
ãããæäœ1çºè©±**ïŒããã¬ãŒãã³ã°é床ãéåžžã«éãïŒ**æäœ100ã¹ããããT4ã§2å**ïŒã§ãïŒ |
|
|
|
**ãªã¢ã«ã¿ã€ã é³å£°å€æ**ã«å¯Ÿå¿ããŠãããã¢ã«ãŽãªãºã ã®é
å»¶ã¯çŽ300msãããã€ã¹åŽã®é
å»¶ã¯çŽ100msã§ããªã³ã©ã€ã³äŒè°ãã²ãŒã ãã©ã€ãé
ä¿¡ã«é©ããŠããŸãã |
|
|
|
ãã¢ã以åã®é³å£°å€æã¢ãã«ãšã®æ¯èŒã«ã€ããŠã¯ã[ãã¢ããŒãž](https://plachtaa.github.io/seed-vc/)ðãš[è©äŸ¡](EVAL.md)ðãã芧ãã ããã |
|
|
|
ã¢ãã«ã®å質åäžãšæ©èœè¿œå ãç¶ç¶çã«è¡ã£ãŠããŸãã |
|
|
|
## è©äŸ¡ð |
|
客芳çè©äŸ¡çµæãšä»ã®ããŒã¹ã©ã€ã³ãšã®æ¯èŒã«ã€ããŠã¯[EVAL.md](EVAL.md)ãã芧ãã ããã |
|
|
|
## ã€ã³ã¹ããŒã«ð¥ |
|
Windows ãŸã㯠Linux ã§ Python 3.10 ãæšå¥šããŸãã |
|
```bash |
|
pip install -r requirements.txt |
|
``` |
|
|
|
## äœ¿çšæ¹æ³ð ïž |
|
ç®çã«å¿ããŠ3ã€ã®ã¢ãã«ããªãªãŒã¹ããŠããŸãïŒ |
|
|
|
| ããŒãžã§ã³ | åç§° | ç®ç | ãµã³ããªã³ã°ã¬ãŒã | ã³ã³ãã³ããšã³ã³ãŒã | ãã³ãŒã | é ãæ¬¡å
| ã¬ã€ã€ãŒæ° | ãã©ã¡ãŒã¿æ° | åè | |
|
|---------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------|---------------|-----------------|---------|------------|----------|--------|--------------------------------------------------------| |
|
| v1.0 | seed-uvit-tat-xlsr-tiny ([ð€](https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_uvit_tat_xlsr_ema.pth)[ð](configs/presets/config_dit_mel_seed_uvit_xlsr_tiny.yml)) | é³å£°å€æ (VC) | 22050 | XLSR-large | HIFT | 384 | 9 | 25M | ãªã¢ã«ã¿ã€ã é³å£°å€æã«é©ããŠããŸã | |
|
| v1.0 | seed-uvit-whisper-small-wavenet ([ð€](https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_seed_v2_uvit_whisper_small_wavenet_bigvgan_pruned.pth)[ð](configs/presets/config_dit_mel_seed_uvit_whisper_small_wavenet.yml)) | é³å£°å€æ (VC) | 22050 | Whisper-small | BigVGAN | 512 | 13 | 98M | ãªãã©ã€ã³é³å£°å€æã«é©ããŠããŸã | |
|
| v1.0 | seed-uvit-whisper-base ([ð€](https://huggingface.co/Plachta/Seed-VC/blob/main/DiT_seed_v2_uvit_whisper_base_f0_44k_bigvgan_pruned_ft_ema.pth)[ð](configs/presets/config_dit_mel_seed_uvit_whisper_base_f0_44k.yml)) | æå£°å€æ (SVC) | 44100 | Whisper-small | BigVGAN | 768 | 17 | 200M | 匷åãªãŒãã·ã§ããããã©ãŒãã³ã¹ãæå£°å€æ | |
|
|
|
ææ°ã®ã¢ãã«ãªãªãŒã¹ã®ãã§ãã¯ãã€ã³ãã¯ãæåã®æšè«å®è¡æã«èªåçã«ããŠã³ããŒããããŸãã |
|
ãããã¯ãŒã¯ã®çç±ã§huggingfaceã«ã¢ã¯ã»ã¹ã§ããªãå Žåã¯ããã¹ãŠã®ã³ãã³ãã®åã« `HF_ENDPOINT=https://hf-mirror.com` ã远å ããŠãã©ãŒã䜿çšããŠãã ããã |
|
|
|
ã³ãã³ãã©ã€ã³æšè«ïŒ |
|
```bash |
|
python inference.py --source <source-wav> |
|
--target <referene-wav> |
|
--output <output-dir> |
|
--diffusion-steps 25 # æå£°å€æã«ã¯30ã50ãæšå¥š |
|
--length-adjust 1.0 |
|
--inference-cfg-rate 0.7 |
|
--f0-condition False # æå£°å€æã®å Žåã¯Trueã«èšå® |
|
--auto-f0-adjust False # ãœãŒã¹ããããã¿ãŒã²ãããããã¬ãã«ã«èªå調æŽããå Žåã¯Trueãéåžžã¯æå£°å€æã§ã¯äœ¿çšããªã |
|
--semi-tone-shift 0 # æå£°å€æã®ãããã·ããïŒåé³åäœïŒ |
|
--checkpoint <path-to-checkpoint> |
|
--config <path-to-config> |
|
--fp16 True |
|
``` |
|
åãã©ã¡ãŒã¿ã®èª¬æïŒ |
|
- `source` ã¯å€æãããé³å£°ãã¡ã€ã«ã®ãã¹ |
|
- `target` ã¯åç
§é³å£°ãã¡ã€ã«ã®ãã¹ |
|
- `output` ã¯åºåãã£ã¬ã¯ããªã®ãã¹ |
|
- `diffusion-steps` ã¯æ¡æ£ã¹ãããæ°ãããã©ã«ãã¯25ãæé«å質ã«ã¯30-50ãæéæšè«ã«ã¯4-10ãäœ¿çš |
|
- `length-adjust` ã¯é·ã調æŽä¿æ°ãããã©ã«ãã¯1.0ã<1.0ã§é³å£°ççž®ã>1.0ã§é³å£°äŒžé· |
|
- `inference-cfg-rate` ã¯åºåã«åŸ®åŠãªéãããããããããã©ã«ãã¯0.7 |
|
- `f0-condition` ã¯ãœãŒã¹é³å£°ã®ããããåºåã«æ¡ä»¶ä»ããããã©ã°ãããã©ã«ãã¯Falseãæå£°å€æã®å Žåã¯True |
|
- `auto-f0-adjust` ã¯ãœãŒã¹ããããã¿ãŒã²ãããããã¬ãã«ã«èªå調æŽãããã©ã°ãããã©ã«ãã¯Falseãéåžžã¯æå£°å€æã§ã¯äœ¿çšããªã |
|
- `semi-tone-shift` ã¯æå£°å€æã®ãããã·ããïŒåé³åäœïŒãããã©ã«ãã¯0 |
|
- `checkpoint` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«ãã§ãã¯ãã€ã³ããžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãã¢ãã«ãèªåããŠã³ããŒãïŒ`f0-condition`ã`False`ã®å Žåã¯`seed-uvit-whisper-small-wavenet`ããã以å€ã¯`seed-uvit-whisper-base`ïŒ |
|
- `config` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«èšå®ãžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãèšå®ãèªåããŠã³ããŒã |
|
- `fp16` ã¯float16æšè«ã䜿çšãããã©ã°ãããã©ã«ãã¯True |
|
|
|
é³å£°å€æWeb UIïŒ |
|
```bash |
|
python app_vc.py --checkpoint <path-to-checkpoint> --config <path-to-config> --fp16 True |
|
``` |
|
- `checkpoint` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«ãã§ãã¯ãã€ã³ããžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãã¢ãã«ãèªåããŠã³ããŒãïŒ`seed-uvit-whisper-small-wavenet`ïŒ |
|
- `config` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«èšå®ãžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãèšå®ãèªåããŠã³ããŒã |
|
|
|
ãã©ãŠã¶ã§`http://localhost:7860/`ã«ã¢ã¯ã»ã¹ããŠWebã€ã³ã¿ãŒãã§ãŒã¹ã䜿çšã§ããŸãã |
|
|
|
æå£°å€æWeb UIïŒ |
|
```bash |
|
python app_svc.py --checkpoint <path-to-checkpoint> --config <path-to-config> --fp16 True |
|
``` |
|
- `checkpoint` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«ãã§ãã¯ãã€ã³ããžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãã¢ãã«ãèªåããŠã³ããŒãïŒ`seed-uvit-whisper-base`ïŒ |
|
- `config` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«èšå®ãžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãèšå®ãèªåããŠã³ããŒã |
|
|
|
çµ±åWeb UIïŒ |
|
```bash |
|
python app.py |
|
``` |
|
ããã¯ãŒãã·ã§ããæšè«çšã®äºååŠç¿æžã¿ã¢ãã«ã®ã¿ãèªã¿èŸŒã¿ãŸããã«ã¹ã¿ã ãã§ãã¯ãã€ã³ãã䜿çšããå Žåã¯ãäžèšã®`app_vc.py`ãŸãã¯`app_svc.py`ãå®è¡ããŠãã ããã |
|
|
|
ãªã¢ã«ã¿ã€ã é³å£°å€æGUIïŒ |
|
```bash |
|
python real-time-gui.py --checkpoint-path <path-to-checkpoint> --config-path <path-to-config> |
|
``` |
|
- `checkpoint` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«ãã§ãã¯ãã€ã³ããžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãã¢ãã«ãèªåããŠã³ããŒãïŒ`seed-uvit-tat-xlsr-tiny`ïŒ |
|
- `config` ã¯ç¬èªã®ã¢ãã«ããã¬ãŒãã³ã°ãŸãã¯ãã¡ã€ã³ãã¥ãŒãã³ã°ããå Žåã®ã¢ãã«èšå®ãžã®ãã¹ã空çœã®å Žåã¯huggingfaceããããã©ã«ãèšå®ãèªåããŠã³ããŒã |
|
|
|
éèŠïŒãªã¢ã«ã¿ã€ã é³å£°å€æã«ã¯GPUã®äœ¿çšãåŒ·ãæšå¥šããŸãã |
|
NVIDIA RTX 3060ããŒãããœã³ã³GPUã§ããã€ãã®ããã©ãŒãã³ã¹ãã¹ããè¡ããçµæãšæšå¥šãã©ã¡ãŒã¿èšå®ã以äžã«ç€ºããŸãïŒ |
|
|
|
| ã¢ãã«æ§æ | æ¡æ£ã¹ããã | æšè«CFGã¬ãŒã | æå€§ããã³ããé· | ãããã¯æé (ç§) | ã¯ãã¹ãã§ãŒãé· (ç§) | 远å ã³ã³ããã¹ã (å·Š) (ç§) | 远å ã³ã³ããã¹ã (å³) (ç§) | ã¬ã€ãã³ã· (ããªç§) | ãã£ã³ã¯ãããã®æšè«æé (ããªç§) | |
|
|---------------------------------|-----------------|--------------------|-------------------|----------------|----------------------|--------------------------|---------------------------|--------------|-------------------------------| |
|
| seed-uvit-xlsr-tiny | 10 | 0.7 | 3.0 | 0.18 | 0.04 | 2.5 | 0.02 | 430 | 150 | |
|
|
|
GUIã§ãã©ã¡ãŒã¿ãèªèº«ã®ããã€ã¹ã®ããã©ãŒãã³ã¹ã«åãããŠèª¿æŽã§ããŸããæšè«æéããããã¯æéããçããã°ãé³å£°å€æã¹ããªãŒã ã¯æ£åžžã«åäœããã¯ãã§ãã |
|
ä»ã®GPUéçŽåã¿ã¹ã¯ïŒã²ãŒã ãåç»èŠèŽãªã©ïŒãå®è¡ããŠããå Žåãæšè«é床ãäœäžããå¯èœæ§ãããããšã«æ³šæããŠãã ããã |
|
|
|
ãªã¢ã«ã¿ã€ã é³å£°å€æGUIã®ãã©ã¡ãŒã¿èª¬æïŒ |
|
- `Diffusion Steps` ã¯æ¡æ£ã¹ãããæ°ããªã¢ã«ã¿ã€ã 倿ã®å Žåã¯éåžž4~10ã§æéæšè« |
|
- `Inference CFG Rate` ã¯åºåã«åŸ®åŠãªéãããããããããã©ã«ãã¯0.7ã0.0ã«èšå®ãããš1.5åã®æšè«é床ãåäž |
|
- `Max Prompt Length` ã¯æå€§ããã³ããé·ãèšå®ãäœããããšæšè«é床ãéããªãããæç€ºé³å£°ãšã®é¡äŒŒæ§ãäœäžããå¯èœæ§ããã |
|
- `Block Time` ã¯æšè«ã®åãªãŒãã£ãª ãã£ã³ã¯ã®æéé·ã§ããå€ã倧ããã»ã©ã¬ã€ãã³ã·ãé·ããªããŸãããã®å€ã¯ãããã¯ãããã®æšè«æéãããé·ãããå¿
èŠãããããšã«æ³šæããŠãã ãããããŒããŠã§ã¢ã®ç¶æ
ã«å¿ããŠèšå®ããŸãã |
|
- `Crossfade Length` ã¯ã¯ãã¹ãã§ãŒãé·ãéåžžã¯å€æŽããªã |
|
- `Extra context (left)` ã¯æšè«ã®ããã®è¿œå å±¥æŽã³ã³ããã¹ãã®æéé·ã§ããå€ãé«ãã»ã©æšè«æéã¯é·ããªããŸãããå®å®æ§ã¯åäžããŸãã |
|
- `Extra context (right)` ã¯æšè«ã®ããã®è¿œå æªæ¥ã³ã³ããã¹ãã®æéé·ã§ããå€ãé«ãã»ã©æšè«æéãšã¬ã€ãã³ã·ã¯é·ããªããŸãããå®å®æ§ã¯åäžããŸãã |
|
|
|
ã¢ã«ãŽãªãºã ã¬ã€ãã³ã·ãŒã¯`Block Time * 2 + Extra context (right)`ã§ãããã€ã¹åŽã¬ã€ãã³ã·ãŒã¯éåžž100msçšåºŠã§ããå
šäœã®é
延㯠2 ã€ã®åèšã§ãã |
|
|
|
[VB-CABLE](https://vb-audio.com/Cable/)ã䜿çšããŠãGUIåºåã¹ããªãŒã ãä»®æ³ãã€ã¯ã«ã«ãŒãã£ã³ã°ããããšãã§ããŸãã |
|
|
|
*ïŒGUIãšãªãŒãã£ãªãã£ã³ãã³ã°ã®ããžãã¯ã¯[RVC](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)ããä¿®æ£ãããŠããŸããçŽ æŽãããå®è£
ã«æè¬ããŸãïŒïŒ* |
|
|
|
## ãã¬ãŒãã³ã°ðïž |
|
ã«ã¹ã¿ã ããŒã¿ã§ã®ãã¡ã€ã³ãã¥ãŒãã³ã°ã«ãããããæ£ç¢ºã«å£°ãã¯ããŒãã³ã°ããããšãã§ããŸããç¹å®ã®è©±è
ã«å¯Ÿãã話è
é¡äŒŒæ§ã倧å¹
ã«åäžããŸãããWERãè¥å¹²äžæããå¯èœæ§ããããŸãã |
|
以äžã®Colabãã¥ãŒããªã¢ã«ã§æé ã確èªã§ããŸãïŒ[](https://colab.research.google.com/drive/1R1BJTqMsTXZzYAVx3j1BiemFXog9pbQG?usp=sharing) |
|
|
|
1. ç¬èªã®ããŒã¿ã»ãããæºåããŸãã以äžã®æ¡ä»¶ãæºããå¿
èŠããããŸãïŒ |
|
- ãã¡ã€ã«æ§é ã¯åããŸãã |
|
- åé³å£°ãã¡ã€ã«ã¯1ã30ç§ã®ç¯å²ã§ããå¿
èŠãããããã以å€ã¯ç¡èŠãããŸã |
|
- ãã¹ãŠã®é³å£°ãã¡ã€ã«ã¯ä»¥äžã®ããããã®åœ¢åŒã§ããå¿
èŠããããŸãïŒ`.wav` `.flac` `.mp3` `.m4a` `.opus` `.ogg` |
|
- 話è
ã©ãã«ã¯å¿
é ã§ã¯ãããŸããããå話è
ã«å°ãªããšã1ã€ã®çºè©±ãããããšã確èªããŠãã ãã |
|
- ãã¡ãããããŒã¿ãå€ãã»ã©ã¢ãã«ã®ããã©ãŒãã³ã¹ã¯åäžããŸã |
|
- ãã¬ãŒãã³ã°ããŒã¿ã¯ã§ããã ãã¯ãªãŒã³ã§ããå¿
èŠããããBGMããã€ãºã¯æãŸãããããŸãã |
|
|
|
2. ãã¡ã€ã³ãã¥ãŒãã³ã°çšã«`configs/presets/`ããã¢ãã«èšå®ãã¡ã€ã«ãéžæãããããŒããããã¬ãŒãã³ã°ããããã®ç¬èªã®èšå®ãäœæããŸãã |
|
- ãã¡ã€ã³ãã¥ãŒãã³ã°ã®å Žåã¯ã以äžã®ãããããéžæããŸãïŒ |
|
- `./configs/presets/config_dit_mel_seed_uvit_xlsr_tiny.yml` ãªã¢ã«ã¿ã€ã é³å£°å€æçš |
|
- `./configs/presets/config_dit_mel_seed_uvit_whisper_small_wavenet.yml` ãªãã©ã€ã³é³å£°å€æçš |
|
- `./configs/presets/config_dit_mel_seed_uvit_whisper_base_f0_44k.yml` æå£°å€æçš |
|
|
|
3. 以äžã®ã³ãã³ãã§ãã¬ãŒãã³ã°ãéå§ããŸãïŒ |
|
```bash |
|
python train.py |
|
--config <path-to-config> |
|
--dataset-dir <path-to-data> |
|
--run-name <run-name> |
|
--batch-size 2 |
|
--max-steps 1000 |
|
--max-epochs 1000 |
|
--save-every 500 |
|
--num-workers 0 |
|
``` |
|
åãã©ã¡ãŒã¿ã®èª¬æïŒ |
|
- `config` ã¯ã¢ãã«èšå®ãžã®ãã¹ããã¡ã€ã³ãã¥ãŒãã³ã°çšã«äžèšã®ãããããéžæãããããŒããããã¬ãŒãã³ã°ããå Žåã¯ç¬èªã®èšå®ãäœæ |
|
- `dataset-dir` ã¯ããŒã¿ã»ãããã£ã¬ã¯ããªãžã®ãã¹ããã¹ãŠã®é³å£°ãã¡ã€ã«ãå«ããã©ã«ãã§ããå¿
èŠããããŸã |
|
- `run-name` ã¯å®è¡åã§ãã¢ãã«ãã§ãã¯ãã€ã³ããšãã°ã®ä¿åã«äœ¿çšãããŸã |
|
- `batch-size` ã¯ãã¬ãŒãã³ã°çšã®ããããµã€ãºã§ãGPUã¡ã¢ãªã«å¿ããŠéžæããŸã |
|
- `max-steps` ã¯æå€§ãã¬ãŒãã³ã°ã¹ãããæ°ã§ãããŒã¿ã»ãããµã€ãºãšãã¬ãŒãã³ã°æéã«å¿ããŠéžæããŸã |
|
- `max-epochs` ã¯æå€§ãšããã¯æ°ã§ãããŒã¿ã»ãããµã€ãºãšãã¬ãŒãã³ã°æéã«å¿ããŠéžæããŸã |
|
- `save-every` ã¯ã¢ãã«ãã§ãã¯ãã€ã³ããä¿åããã¹ãããéé |
|
- `num-workers` ã¯ããŒã¿èªã¿èŸŒã¿ã®ã¯ãŒã«ãŒæ°ãWindowsã®å Žåã¯0ã«èšå® |
|
|
|
4. ãã¬ãŒãã³ã°ãäºæãã忢ããå Žåãåãã³ãã³ããå床å®è¡ããããšã§ãæåŸã®ãã§ãã¯ãã€ã³ãããåéã§ããŸãïŒææ°ã®ãã§ãã¯ãã€ã³ããèŠã€ããããããã«ã`run-name`ãš`config`åŒæ°ãåãã§ããããšã確èªããŠãã ããïŒã |
|
|
|
5. ãã¬ãŒãã³ã°åŸããã§ãã¯ãã€ã³ããšèšå®ãã¡ã€ã«ã®ãã¹ãæå®ããããšã§ããã¬ãŒãã³ã°ããã¢ãã«ãæšè«ã«äœ¿çšã§ããŸãã |
|
- ãããã¯`./runs/<run-name>/`ã®äžã«ããããã§ãã¯ãã€ã³ãã¯`ft_model.pth`ãšããååã§ãèšå®ãã¡ã€ã«ã¯ãã¬ãŒãã³ã°èšå®ãã¡ã€ã«ãšåãååã§ãã |
|
- æšè«æã«ã¯ããŒãã·ã§ããäœ¿çšæãšåæ§ã«ã䜿çšããã話è
ã®åç
§é³å£°ãã¡ã€ã«ãæå®ããå¿
èŠããããŸãã |
|
|
|
## TODOð |
|
- [x] ã³ãŒãã®ãªãªãŒã¹ |
|
- [x] äºååŠç¿æžã¿ã¢ãã«ã®ãªãªãŒã¹ïŒ[](https://huggingface.co/Plachta/Seed-VC) |
|
- [x] Huggingfaceã¹ããŒã¹ãã¢ïŒ[](https://huggingface.co/spaces/Plachta/Seed-VC) |
|
- [x] HTMLãã¢ããŒãžïŒ[Demo](https://plachtaa.github.io/seed-vc/) |
|
- [x] ã¹ããªãŒãã³ã°æšè« |
|
- [x] ã¹ããªãŒãã³ã°æšè«ã®ã¬ã€ãã³ã·ãŒåæž |
|
- [x] ãªã¢ã«ã¿ã€ã é³å£°å€æã®ãã¢åç» |
|
- [x] æå£°å€æ |
|
- [x] ãœãŒã¹é³å£°ã®ãã€ãºèæ§ |
|
- [ ] ã¢ãŒããã¯ãã£ã®æœåšçãªæ¹å |
|
- [x] U-ViTã¹ã¿ã€ã«ã®ã¹ãããæ¥ç¶ |
|
- [x] OpenAI Whisperãžã®å
¥åå€æŽ |
|
- [x] Time as Token |
|
- [x] ã«ã¹ã¿ã ããŒã¿ã§ã®ãã¬ãŒãã³ã°ã³ãŒã |
|
- [x] ãã¥ãŒã·ã§ãã/ã¯ã³ã·ã§ãã話è
ãã¡ã€ã³ãã¥ãŒãã³ã° |
|
- [x] æå£°ãã³ãŒãã£ã³ã°çšã«NVIDIAã®BigVGANã«å€æŽ |
|
- [x] æå£°å€æçšã®WhisperããŒãžã§ã³ã¢ãã« |
|
- [x] æå£°å€æã®RVC/SoVITSãšã®å®¢èгçè©äŸ¡ãšæ¯èŒ |
|
- [x] é³å£°å質ã®åäž |
|
- [ ] ããè¯ãæå£°å€æã®ããã®NSFãã³ãŒã |
|
- [x] éçºè©±æã®ãªã¢ã«ã¿ã€ã é³å£°å€æã¢ãŒãã£ãã¡ã¯ãã®ä¿®æ£ïŒVADã¢ãã«ã®è¿œå ã«ãã察å¿ïŒ |
|
- [x] ãã¡ã€ã³ãã¥ãŒãã³ã°äŸã®ColabããŒããã㯠|
|
- [ ] Whisperãããé«åºŠãªæå³æœåºåšã«çœ®ãæãã |
|
- [ ] ä»åŸè¿œå äºå® |
|
|
|
## æŽæ°å±¥æŽðïž |
|
- 2024-11-26: |
|
- ãªã¢ã«ã¿ã€ã é³å£°å€æçšã«æé©åãããv1.0 tinyããŒãžã§ã³ã®äºååŠç¿æžã¿ã¢ãã«ãæŽæ° |
|
- ã¯ã³ã·ã§ãã/ãã¥ãŒã·ã§ããã®åäž/è€æ°è©±è
ãã¡ã€ã³ãã¥ãŒãã³ã°ããµããŒã |
|
- webUIããã³ãªã¢ã«ã¿ã€ã GUIã§ã«ã¹ã¿ã ãã§ãã¯ãã€ã³ãã®äœ¿çšããµããŒã |
|
- 2024-11-19: |
|
- arXivè«æå
Ž |
|
- 2024-10-28: |
|
- ããè¯ãé³å£°å質ã®ãã¡ã€ã³ãã¥ãŒãã³ã°ããã44kæå£°å€æã¢ãã«ãæŽæ° |
|
- 2024-10-27: |
|
- ãªã¢ã«ã¿ã€ã é³å£°å€æGUIã远å |
|
- 2024-10-25: |
|
- æå£°å€æã®RVCv2ãšã®å
æ¬çãªè©äŸ¡çµæãšæ¯èŒã远å |
|
- 2024-10-24: |
|
- é³å£°ã³ã³ãã³ãå
¥åãšããŠOpenAI Whisperã䜿çšãã44kHzæå£°å€æã¢ãã«ãæŽæ° |
|
- 2024-10-07: |
|
- é³å£°ã³ã³ãã³ããšã³ã³ãŒããOpenAI Whisperã«å€æŽããv0.3äºååŠç¿æžã¿ã¢ãã«ãæŽæ° |
|
- v0.3äºååŠç¿æžã¿ã¢ãã«ã®å®¢èгçè©äŸ¡çµæã远å |
|
- 2024-09-22: |
|
- NVIDIAã®BigVGANã䜿çšããæå£°å€æã¢ãã«ãæŽæ°ããé«é³åã®æå£°ã倧å¹
ã«æ¹å |
|
- Web UIã§é·ãé³å£°ãã¡ã€ã«ã®ãã£ã³ãã³ã°ãšã¹ããªãŒãã³ã°åºåããµããŒã |
|
- 2024-09-18: |
|
- æå£°å€æçšã®f0æ¡ä»¶ä»ãã¢ãã«ãæŽæ° |
|
- 2024-09-14: |
|
- åãå質ãéæããããã®ãµã€ãºçž®å°ã𿡿£ã¹ãããæ°ã®åæžãããã³ãããœãã£ä¿æã®å¶åŸ¡èœåã远å ããv0.2äºååŠç¿æžã¿ã¢ãã«ãæŽæ° |
|
- ã³ãã³ãã©ã€ã³æšè«ã¹ã¯ãªããã远å |
|
- ã€ã³ã¹ããŒã«ãšäœ¿ç𿹿³ã®èª¬æã远å |