-
Tsinghua University
- Beijing
-
13:14
(UTC +08:00) - @Amy31784799
Starred repositories
MineStudio: A Streamlined Package for Minecraft AI Agent Development
Fast implementation of the edit distance(Levenshtein distance)
Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment" by Robert F. Kubichek.
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
A multi-voice TTS system trained with an emphasis on quality
💻 🤖 A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech 🔈
Python implementation of performance metrics in Loizou's Speech Enhancement book
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Ama…
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.
Python interface to the WebRTC Voice Activity Detector
A Python wrapper for the high-quality vocoder "World"
Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.
Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/
Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"
A quickstart and benchmark for pytorch distributed training.
You can find the speech algorithms you want here
Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis