Skip to content
View chenyi0818's full-sized avatar

Block or report chenyi0818

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Survey of Spoken Dialogue Models (60 pages)

225 13 Updated Nov 28, 2024

🇨🇳 Chinese sticker pack,More joy / 表情包的博物馆, Github最有毒的仓库, 中国表情包大集合, 聚欢乐~

JavaScript 12,303 1,253 Updated Sep 29, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 8,871 856 Updated Dec 18, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,576 1,874 Updated Apr 30, 2024

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python 371 42 Updated Sep 13, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,779 759 Updated Jun 24, 2024

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

270,052 21,117 Updated Oct 3, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 37,398 4,246 Updated Dec 19, 2024

MindSpore online courses: Step into LLM

Jupyter Notebook 438 102 Updated Nov 21, 2024

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 14,257 1,272 Updated Sep 5, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 40,374 4,284 Updated Jul 28, 2024

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,074 868 Updated Jul 6, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 7,965 600 Updated Dec 27, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 136,836 27,395 Updated Dec 27, 2024

Inference code for Llama models

Python 56,977 9,632 Updated Aug 18, 2024

喜马拉雅xm文件解密工具

Python 361 98 Updated May 20, 2024

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis

Python 224 30 Updated Jul 13, 2022

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Python 361 52 Updated Apr 21, 2022

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,071 323 Updated Nov 14, 2023

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,295 103 Updated Sep 24, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,021 4,170 Updated Dec 26, 2024

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Python 498 84 Updated Dec 28, 2023

A timeline of the latest AI models for audio generation, starting in 2023!

1,895 70 Updated Jan 4, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,121 1,551 Updated Feb 29, 2024

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

Python 11,623 1,943 Updated Dec 6, 2024

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,976 419 Updated May 10, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,480 2,575 Updated Dec 15, 2024

Official Implement of Multi-Stage Multi-Codebook (MSMC) TTS

Python 162 15 Updated Apr 10, 2024

A Collection of Variational Autoencoders (VAE) in PyTorch.

Python 6,813 1,080 Updated Jun 13, 2024
Next