Skip to content
View jiali-ms's full-sized avatar

Block or report jiali-ms

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple, performant re-implementation of AutoVC

Jupyter Notebook 21 4 Updated Jul 6, 2023

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 5,205 511 Updated Feb 17, 2025

A PyTorch-based Speech Toolkit

Python 9,475 1,448 Updated Mar 10, 2025

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Python 297 47 Updated Aug 25, 2021

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques …

Python 48 15 Updated Jul 31, 2023

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 51 6 Updated Nov 1, 2019

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Python 1,039 211 Updated Oct 23, 2024

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Jupyter Notebook 897 176 Updated Jul 6, 2023

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Python 1,694 277 Updated Feb 15, 2023

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 29,978 3,747 Updated Aug 6, 2024

CMU multilingual speech repository

Python 31 2 Updated Apr 15, 2022
Roff 14 Updated Jun 10, 2021

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook 859 183 Updated Jul 22, 2023

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook 6 4 Updated Nov 15, 2020

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,967 558 Updated Oct 27, 2023

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html) for various learning resources!

Python 1,679 203 Updated Jun 24, 2024

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Python 608 89 Updated Apr 26, 2024

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,080 545 Updated May 23, 2024

g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

Python 239 31 Updated Jul 10, 2019

A set of tools to use in Microsoft Azure Form Recognizer and OCR services.

TypeScript 526 173 Updated Sep 4, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,122 6,489 Updated Jan 9, 2025

LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference

Python 79 4 Updated Aug 27, 2021

End-to-End Speech Processing Toolkit

Python 8,853 2,226 Updated Mar 10, 2025

JP puncuator

Python 6 3 Updated Jun 20, 2019

VS Code in the browser

TypeScript 70,183 5,803 Updated Mar 10, 2025

Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV).…

C# 1,701 435 Updated Feb 19, 2025

A Python wrapper for Kaldi

Python 1,009 246 Updated Jan 23, 2025

A header-only C++ library for deep neural networks

C++ 406 94 Updated Apr 16, 2021

Speech Recognition using DeepSpeech2.

Python 2,116 620 Updated Dec 13, 2022

汉字转拼音(pypinyin)

Python 4,994 622 Updated Jan 3, 2025
Next