Skip to content
View w-okada's full-sized avatar

Sponsors

@unsolublesugar
@grimoire-vc

Block or report w-okada

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,954 480 Updated Dec 26, 2024

zero-shot voice conversion & singing voice conversion, with real-time support

Python 988 122 Updated Jan 17, 2025

Versatile Evaluation of Speech and Audio

Python 156 13 Updated Feb 8, 2025

声質変換 VST

C++ 26 2 Updated Jan 19, 2025

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 5,417 483 Updated Aug 10, 2024
Python 6 Updated Dec 6, 2024

Multilingual Voice Understanding Model

Python 4,309 379 Updated Jan 8, 2025

Python interface to the WebRTC Voice Activity Detector

C 2,137 410 Updated Jul 4, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 40,085 4,493 Updated Jan 18, 2025

The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.

Python 27,245 844 Updated Feb 9, 2025

Library for building powerful interactive command line applications in Python

Python 9,521 726 Updated Jan 21, 2025

a lightweight voice conversion

Python 79 12 Updated Sep 2, 2024

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 140 22 Updated Oct 16, 2023

vits2 backbone with multilingual-bert

Python 8,218 1,168 Updated Feb 3, 2025

Faster Whisper transcription with CTranslate2

Python 13,975 1,168 Updated Jan 1, 2025
Python 389 33 Updated Nov 6, 2023

App showcasing multiple real-time diffusion models pipelines with Diffusers

Python 887 104 Updated Jun 21, 2024

prompt2model - Generate Deployable Models from Natural Language Instructions

Python 1,980 178 Updated Dec 29, 2024

Resample audio in node or browser using a web assembly port of libsamplerate.

JavaScript 39 12 Updated Feb 7, 2025

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 17,106 1,874 Updated Nov 14, 2024

ChatGPT plugin for Zotero

TypeScript 236 11 Updated Nov 18, 2024
Python 120 22 Updated Oct 18, 2024
Python 61 12 Updated Oct 3, 2023

44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transformです。

Python 36 3 Updated Jun 2, 2023

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,954 853 Updated Aug 20, 2024

Speech AI training and inference tools

Python 36 2 Updated Jun 25, 2023

The missing star history graph of GitHub repos - https://star-history.com

TypeScript 6,950 267 Updated Feb 5, 2025
Python 434 61 Updated Feb 7, 2025

Easily train a good VC model with voice data <= 10 mins!

Python 26,776 3,832 Updated Nov 24, 2024
Next