jiali-ms

Jerry Yao jiali-ms

AI, LM, SR

17 followers · 11 following

Tokyo
nlpfun.com

Achievements

Stars

RF5 / simple-autovc

A simple, performant re-implementation of AutoVC

Jupyter Notebook 21 4 Updated Jul 6, 2023

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 5,205 511 Updated Feb 17, 2025

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,475 1,448 Updated Mar 10, 2025

keonlee9420 / Expressive-FastSpeech2

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Python 297 47 Updated Aug 25, 2021

keonlee9420 / Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques …

Python 48 15 Updated Jul 31, 2023

bfs18 / tacotron2

Forked from NVIDIA/tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 51 6 Updated Nov 1, 2019

auspicious3000 / autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Python 1,039 211 Updated Oct 23, 2024

NVIDIA / flowtron

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Jupyter Notebook 897 176 Updated Jul 6, 2023

rosinality / vq-vae-2-pytorch

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Python 1,694 277 Updated Feb 15, 2023

xinntao / Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 29,978 3,747 Updated Aug 6, 2024

wavlab-speech / cmu_multilingual_speech

CMU multilingual speech repository

Python 31 2 Updated Apr 15, 2022

creotiv / RussianTTS-Tacotron2

Roff 14 Updated Jun 10, 2021

NVIDIA / mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook 859 183 Updated Jul 22, 2023

taneliang / gst-tacotron2

Forked from NVIDIA/mellotron

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook 6 4 Updated Nov 15, 2020

ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,967 558 Updated Oct 27, 2023

graph4ai / graph4nlp

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html) for various learning resources!

Python 1,679 203 Updated Jun 24, 2024

xinjli / allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Python 608 89 Updated Apr 26, 2024

CLUEbenchmark / CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,080 545 Updated May 23, 2024

Kyubyong / g2pC

g2pC: A Context-aware Grapheme-to-Phoneme Conversion module for Chinese

Python 239 31 Updated Jul 10, 2019

microsoft / OCR-Form-Tools

A set of tools to use in Microsoft Azure Form Recognizer and OCR services.

TypeScript 526 173 Updated Sep 4, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,122 6,489 Updated Jan 9, 2025

zomux / lanmt

LaNMT: Latent-variable Non-autoregressive Neural Machine Translation with Deterministic Inference

Python 79 4 Updated Aug 27, 2021

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,853 2,226 Updated Mar 10, 2025

jiali-ms / punctuator

JP puncuator

Python 6 3 Updated Jun 20, 2019

coder / code-server

VS Code in the browser

TypeScript 70,183 5,803 Updated Mar 10, 2025

microsoft / Recognizers-Text

Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV).…

C# 1,701 435 Updated Feb 19, 2025

pykaldi / pykaldi

A Python wrapper for Kaldi

Python 1,009 246 Updated Jan 23, 2025

yixuan / MiniDNN

A header-only C++ library for deep neural networks

C++ 406 94 Updated Apr 16, 2021

SeanNaren / deepspeech.pytorch

Speech Recognition using DeepSpeech2.

Python 2,116 620 Updated Dec 13, 2022

mozillazg / python-pinyin

汉字转拼音(pypinyin)

Python 4,994 622 Updated Jan 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jerry Yao jiali-ms

Achievements

Achievements

Block or report jiali-ms

Stars

RF5 / simple-autovc

snakers4 / silero-vad

speechbrain / speechbrain

keonlee9420 / Expressive-FastSpeech2

keonlee9420 / Comprehensive-Tacotron2

bfs18 / tacotron2

auspicious3000 / autovc

NVIDIA / flowtron

rosinality / vq-vae-2-pytorch

xinntao / Real-ESRGAN

wavlab-speech / cmu_multilingual_speech

creotiv / RussianTTS-Tacotron2

NVIDIA / mellotron

taneliang / gst-tacotron2

ming024 / FastSpeech2

graph4ai / graph4nlp

xinjli / allosaurus

CLUEbenchmark / CLUE

Kyubyong / g2pC

microsoft / OCR-Form-Tools

facebookresearch / fairseq

zomux / lanmt

espnet / espnet

jiali-ms / punctuator

coder / code-server

microsoft / Recognizers-Text

pykaldi / pykaldi

yixuan / MiniDNN

SeanNaren / deepspeech.pytorch

mozillazg / python-pinyin