Highlights
- Pro
Stars
A Python wrapper for the high-quality vocoder "World"
Godot Engine – Multi-platform 2D and 3D game engine
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
Self-supervised learning for fast pitch estimation
A python package to analyze and compare voices with deep learning
Implementation of vocoders empowered with pytorch lightning
UT-Sarulab MOS prediction system using SSL models
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
A high-quality speech analysis, manipulation and synthesis system
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Simplified implementation of the RVC (Retrieval-based Voice Conversion) evaluation for easy integration into other projects. Removes unnecessary features and provides a sample CLI for real-time con…
A multi-speaker, multilingual speech generation tool
Easily train a good VC model with voice data <= 10 mins!
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
Official implementation of SawSing (ISMIR'22)
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
AIを使ったリアルタイムボイスチェンジャー(Trainer)
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Experiments from the paper "Sinusoidal Frequency Estimation by Gradient Descent"