Skip to content
View inconnu11's full-sized avatar
🥝
digging
🥝
digging
  • Tsinghua University
  • Beijing
  • 13:14 (UTC +08:00)
  • X @Amy31784799

Organizations

@thuhcsi

Block or report inconnu11

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

MineStudio: A Streamlined Package for Minecraft AI Agent Development

Python 190 8 Updated Feb 25, 2025

Mamba SSM architecture

Python 14,073 1,227 Updated Jan 18, 2025

Fast implementation of the edit distance(Levenshtein distance)

C++ 677 63 Updated Feb 16, 2024

Playground for Transformers

Python 48 16 Updated Dec 16, 2023

A fast, local neural text to speech system

C++ 7,980 592 Updated Oct 21, 2024

Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assessment" by Robert F. Kubichek.

Python 51 10 Updated Dec 11, 2024

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,898 815 Updated Jul 5, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,739 1,908 Updated Nov 19, 2024

💻 🤖 A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech 🔈

Jupyter Notebook 442 45 Updated Jun 26, 2024

Python implementation of performance metrics in Loizou's Speech Enhancement book

Python 405 89 Updated Feb 15, 2025

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Ama…

HTML 212 58 Updated May 23, 2024

Praat in Python, the Pythonic way

C++ 1,096 119 Updated Feb 12, 2025

Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)

Python 103 14 Updated Nov 26, 2022

Uniform Manifold Approximation and Projection

Python 7,646 821 Updated Nov 29, 2024

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

Python 333 45 Updated Jul 6, 2023

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

Python 142 23 Updated Sep 1, 2020

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

Jupyter Notebook 344 56 Updated Apr 27, 2022

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.

Python 183 48 Updated Aug 12, 2020

Python interface to the WebRTC Voice Activity Detector

C 2,149 410 Updated Jul 4, 2024

A Python wrapper for the high-quality vocoder "World"

Cython 741 121 Updated Jan 21, 2025
C++ 397 93 Updated Nov 30, 2021

Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.

Python 23 3 Updated Jan 24, 2021

Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/

Jupyter Notebook 246 43 Updated Feb 3, 2025

Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"

Jupyter Notebook 1,166 402 Updated Oct 31, 2020

TextRank implementation for Python 3.

Python 1,253 259 Updated Mar 28, 2023

PPG-Based Voice Conversion

Python 333 72 Updated Jul 22, 2022

A quickstart and benchmark for pytorch distributed training.

Python 1,653 297 Updated Jul 25, 2024

You can find the speech algorithms you want here

C 779 246 Updated Jan 1, 2025

Variational Autoencoder in the mel-spectrogram domain for one-shot audio synthesis

Jupyter Notebook 132 17 Updated Dec 12, 2021
Next