Skip to content

NeuralFalconYT/Kokoro-82M-WebUI

Repository files navigation

Kokoro-TTS

Note: This is not the official repository.
The written code is not well-organized.

Alternative ways to use Kokoro-TTS kokoro-onnx, Kokoro-FastAPI, kokoro, kokoro-web, Kokoro-Custom-Voice

Open In Colab
HuggingFace Space Demo


Installation Tutorial

My Python Version is 3.10.9.

1. Clone the GitHub Repository:

git clone https://github.com/NeuralFalconYT/Kokoro-82M-WebUI.git
cd Kokoro-82M-WebUI

2. Create a Python Virtual Environment:

python -m venv myenv

This command creates a new Python virtual environment named myenv for isolating dependencies.

3. Activate the Virtual Environment:

  • For Windows:
    myenv\Scripts\activate
  • For Linux:
    source myenv/bin/activate

This activates the virtual environment, enabling you to install and run dependencies in an isolated environment. Here’s the corrected version of point 4, with proper indentation for the subpoints:

4. Install PyTorch:

  • For GPU (CUDA-enabled installation):
    • Check CUDA Version (for GPU setup):

      nvcc --version

      Find your CUDA version example 11.8

    • Visit PyTorch Get Started and install the version compatible with your CUDA setup.:

      • For CUDA 11.8:
      pip install torch  --index-url https://download.pytorch.org/whl/cu118
      
      • For CUDA 12.1:
      pip install torch  --index-url https://download.pytorch.org/whl/cu121
      
      • For CUDA 12.4:
      pip install torch  --index-url https://download.pytorch.org/whl/cu124
      
  • For CPU (if not using GPU):
    pip install torch
    This installs the CPU-only version of PyTorch.

5. Install Required Dependencies:

pip install -r requirements.txt

This installs all the required Python libraries listed in the requirements.txt file.

6. Download Model and Get Latest VoicePack:

python download_model.py

7. Install eSpeak NG

  • For Windows:

    1. Download the latest eSpeak NG release from the eSpeak NG GitHub Releases.
    2. Locate and download the file named espeak-ng-X64.msi.
    3. Run the installer and follow the installation steps. Ensure that you install eSpeak NG in the default directory:
      C:\Program Files\eSpeak NG
      

      Note: This default path is required for the application to locate eSpeak NG properly.

  • For Linux:

    1. Open your terminal.
    2. Install eSpeak NG using the following command:
      sudo apt-get -qq -y install espeak-ng > /dev/null 2>&1

      Note: This command suppresses unnecessary output for a cleaner installation process.


8. Install ffmpeg [Only For Linux Users]

Skip this step if you are using Windows. You only need FFmpeg if you plan to use it for subtitle dubbing feature. If you just want to use Kokoro TTS, you can skip this step too.

  apt-get update
  !apt-get install -y ffmpeg

9. Run Gradio App

To run the Gradio app, follow these steps:

  1. Activate the Virtual Environment:

    myenv\Scripts\activate
  2. Run the Application:

    python app.py

    Alternatively, on Windows, double-click on run_app.bat to start the application.


1 2 3 4 5

License

Kokoro model, is licensed under the Apache License 2.0
The inference code adapted from StyleTTS2 is MIT licensed.

Credits

Model: Kokoro HuggingFace

Podcast Generation Inspiration: E2-F5-TTS

Voice Mix Feature: Make Custom Voices With KokoroTTS

AI Assistance:
ChatGPT
Google AI Studio
Github Copilot

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published