Stars
This is the code for controllable EVC framework for seen and unseen emotion generation.
You can find the speech algorithms you want here
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
2021年最新整理,5000道秋招/提前批/春招/常用面试题(含答案),包括leetcode,校招笔试题,面试题,算法题,语法题。
Free course for Resume, 整理和搜集网络免费的项目实战课程,包括 Java 项目实战,Python 项目实战,C++ 项目实战等
Implementation of "Perceptual Losses for Real-Time Style Transfer and Super-Resolution" in PyTorch
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
A pytorch based end2end speech recognition system.
End-to-End Automatic Speech Recognition on PyTorch
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Implementation code of non-parallel sequence-to-sequence VC
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage sequence-to-sequence training.
PyTorch implementation of a self-attentive speaker embedding
A collection of datasets for the purpose of emotion recognition/detection in speech.
Voice conversion training with 109 speakers with limited training samples
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization.
A speech synthesis system with prosody embeddings
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Emotional Speech Conversion using Style Transfer and MUNIT
numpy、pandas数据分析基础知识!(灰常重要!!!)