This is a text-to-speech Gradio webui for RVC models, using edge-tts.
Requirements: Tested for Python 3.10 on Windows 11.
git clone https://github.com/litagin02/rvc-tts-webui.git
cd rvc-tts-webui
# Download models
curl -L -O https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/hubert_base.pt
curl -L -O https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/rmvpe.pt
# Make virtual environment
python -m venv venv
# Activate venv (for Windows)
venv\Scripts\activate
# Install PyTorch manually if you want to use NVIDIA GPU (Windows)
# See https://pytorch.org/get-started/locally/ for more details
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
# Install requirements
pip install -r requirements.txt
Locate your RVC models in weights/
directory as follows:
weights
├── model1
│ ├── my_model1.pth
│ └── my_index_file_for_model1.index
└── model2
├── my_model2.pth
└── my_index_file_for_model2.index
...
Each model directory should contain exactly one .pth
file and at most one .index
file.
# Activate venv (for Windows)
venv\Scripts\activate
python app.py