inconnu11

Follow

🥝

digging

inconnu11

🥝

digging

Follow

TTS, Voice conversion(VC), Speech representation learning

175 followers · 1.2k following

Tsinghua University
Beijing
13:14 (UTC +08:00)
@Amy31784799

Achievements

Achievements

Organizations

Starred repositories

BytedanceSpeech / seed-tts-eval

Python 1,173 111 Updated Jun 14, 2024

CraftJarvis / MineStudio

MineStudio: A Streamlined Package for Minecraft AI Agent Development

Python 190 8 Updated Feb 25, 2025

state-spaces / mamba

Mamba SSM architecture

Python 14,073 1,227 Updated Jan 18, 2025

roy-ht / editdistance

Fast implementation of the edit distance(Levenshtein distance)

C++ 677 63 Updated Feb 16, 2024

Montinger / Transformer-Workbench

Playground for Transformers

Python 48 16 Updated Dec 16, 2023

rhasspy / piper

A fast, local neural text to speech system

C++ 7,980 592 Updated Oct 21, 2024

jasminsternkopf / mel_cepstral_distance

Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment" by Robert F. Kubichek.

Python 51 10 Updated Dec 11, 2024

TensorSpeech / TensorFlowTTS

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,898 815 Updated Jul 5, 2024

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,739 1,908 Updated Nov 19, 2024

Emotional-Text-to-Speech / dl-for-emo-tts

💻 🤖 A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech 🔈

Jupyter Notebook 442 45 Updated Jun 26, 2024

schmiph2 / pysepm

Python implementation of performance metrics in Loizou's Speech Enhancement book

Python 405 89 Updated Feb 15, 2025

microsoft / P.808

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Ama…

HTML 212 58 Updated May 23, 2024

YannickJadoul / Parselmouth

Praat in Python, the Pythonic way

C++ 1,096 119 Updated Feb 12, 2025

awslabs / speech-representations

Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)

Python 103 14 Updated Nov 26, 2022

lmcinnes / umap

Uniform Manifold Approximation and Projection

Python 7,646 821 Updated Nov 29, 2024

bshall / ZeroSpeech

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Python 333 45 Updated Jul 6, 2023

bshall / VectorQuantizedCPC

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

Python 142 23 Updated Sep 1, 2020

Wendison / VQMIVC

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

Jupyter Notebook 344 56 Updated Apr 27, 2022

ivanvovk / durian-pytorch

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.

Python 183 48 Updated Aug 12, 2020

wiseman / py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

C 2,149 410 Updated Jul 4, 2024

JeremyCCHsu / Python-Wrapper-for-World-Vocoder

A Python wrapper for the high-quality vocoder "World"

Cython 741 121 Updated Jan 21, 2025

google / REAPER

C++ 397 93 Updated Nov 30, 2021

himajin2045 / voice-conversion

Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.

Python 23 3 Updated Jan 24, 2021

mesolitica / malaya-speech

Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/

Jupyter Notebook 246 43 Updated Feb 3, 2025

musikalkemist / AudioSignalProcessingForML

Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"

Jupyter Notebook 1,166 402 Updated Oct 31, 2020

summanlp / textrank

TextRank implementation for Python 3.

Python 1,253 259 Updated Mar 28, 2023

liusongxiang / ppg-vc

PPG-Based Voice Conversion

Python 333 72 Updated Jul 22, 2022

tczhangzhi / pytorch-distributed

A quickstart and benchmark for pytorch distributed training.

Python 1,653 297 Updated Jul 25, 2024

Ryuk17 / SpeechAlgorithms

You can find the speech algorithms you want here

C 779 246 Updated Jan 1, 2025

moiseshorta / MelSpecVAE

Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis

Jupyter Notebook 132 17 Updated Dec 12, 2021

Starred topics

wavenet