Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 7,965 600 Updated Dec 27, 2024

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,836 27,395 Updated Dec 27, 2024

meta-llama / llama

Inference code for Llama models

Python 56,977 9,632 Updated Aug 18, 2024

Diaoxiaozhang / Ximalaya-XM-Decrypt

喜马拉雅xm文件解密工具

Python 361 98 Updated May 20, 2024

tencent-ailab / bddm

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

Python 224 30 Updated Jul 13, 2022

wesbz / SoundStream

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Python 361 52 Updated Apr 21, 2022

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,071 323 Updated Nov 14, 2023

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,295 103 Updated Sep 24, 2023

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,021 4,170 Updated Dec 26, 2024

modelscope / KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Python 498 84 Updated Dec 28, 2023

archinetai / audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

1,895 70 Updated Jan 4, 2024

CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,121 1,551 Updated Feb 29, 2024

xmu-xiaoma666 / External-Attention-pytorch

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 11,623 1,943 Updated Dec 6, 2024

enhuiz / vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,976 419 Updated May 10, 2023

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,480 2,575 Updated Dec 15, 2024

hhguo / MSMC-TTS

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

Python 162 15 Updated Apr 10, 2024

AntixK / PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 6,813 1,080 Updated Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chenyi0818

Block or report chenyi0818

Stars

jishengpeng / WavChat

zhaoolee / ChineseBQB

FunAudioLLM / CosyVoice

ymcui / Chinese-LLaMA-Alpaca

KdaiP / StableTTS

jasonppy / VoiceCraft

996icu / 996.ICU

RVC-Boss / GPT-SoVITS

mindspore-courses / step_into_llm

LlamaFamily / Llama-Chinese

AmadeusChan / Awesome-LLM-System-Papers

mlabonne / llm-course

AIGC-Audio / AudioGPT

open-mmlab / Amphion