This guide will show you how to use Google Colab's free computing power to train high-quality RVC voice conversion models. For the best results, we standardize the process into three stages:
The quality of a model is 90% dependent on the quality of the dataset. Please ensure your training audio is an absolutely clean voice with no background music (BGM) and no reverberation (Reverb/Echo).
vocals.wav.Vocals separated by this application usually still have spatial reverberation, which will cause the model training to fail (the voice sounds muddy). Please make sure to use UVR5 (Ultimate Vocal Remover) for secondary processing.
320 (Default) or 51210001.wav, 002.wav), avoiding special symbols.dataset..wav files into this folder.dataset.zip file.Structure Example:
dataset.zip
└── dataset/
├── 001.wav
├── 002.wav
└── ...
We recommend using Applio Colab (currently the most powerful RVC modification).
Click the play button (▶) on the left of each block in order:
gradio.live), click to enter the graphical interface.my_voice).40k or 48k.dataset.zip.rmvpe (best results).100 ~ 300 (train more rounds if you have less
data).8 ~ 12.Once training is complete, download the .pth model file and .index index file
from Colab.
models/RVC/.My_AI_Voice).