Skip to content
View markyouyuren's full-sized avatar

Block or report markyouyuren

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 947 85 Updated Mar 7, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 10,170 1,388 Updated Feb 24, 2025

Training code for FAcodec presented in NaturalSpeech3

Python 195 22 Updated Aug 26, 2024

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,044 75 Updated Mar 2, 2025

Text-to-Music Generation with Rectified Flow Transformers

Python 1,672 133 Updated Dec 10, 2024

AudioLDM training, finetuning, evaluation and inference.

Python 237 48 Updated Dec 13, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 41,976 4,682 Updated Mar 5, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,645 672 Updated Mar 3, 2025

Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection

Jupyter Notebook 196 17 Updated Jan 24, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 38,267 4,791 Updated Aug 16, 2024

💬 SpeechGPT is a web application that enables you to converse with ChatGPT.

TypeScript 2,761 393 Updated Oct 16, 2023

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,314 104 Updated Sep 24, 2023

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,100 321 Updated Nov 14, 2023

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,987 417 Updated May 10, 2023

Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.

Jupyter Notebook 135 12 Updated Aug 14, 2023

Singing Voice Synthesis based on VITS, different from VISinger

Python 188 31 Updated Nov 13, 2023

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

Python 162 16 Updated Apr 10, 2024

Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech

Jupyter Notebook 22 6 Updated Aug 11, 2022

Singing Voice Speech modeling test

Python 35 10 Updated Aug 16, 2022

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,203 1,315 Updated Dec 6, 2023

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Python 117 15 Updated Jul 14, 2022

A repository for benchmarking neural vocoders by their quality and speed.

Python 208 28 Updated Feb 26, 2025

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Python 381 59 Updated May 30, 2024

Deep Performer: Score-to-audio music performance synthesis

SCSS 43 4 Updated Jun 26, 2023

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Python 4,410 729 Updated May 2, 2023

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python 4,362 1,113 Updated Feb 25, 2025

尝试使用神经网络生成音乐游戏Malody的谱面。

Jupyter Notebook 47 12 Updated Feb 19, 2020

C++ library for audio and music analysis, description and synthesis, including Python bindings

C++ 2,993 550 Updated Jan 29, 2025

Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"

Python 2,610 614 Updated Jan 19, 2020
Next