Skip to content
View aixingxy's full-sized avatar

Block or report aixingxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

Python 75 9 Updated Sep 24, 2024

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

141 7 Updated Sep 27, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 233 13 Updated Sep 25, 2024
Python 248 23 Updated Mar 15, 2024

vits2 backbone with multilingual-bert

Python 7,869 1,117 Updated Oct 7, 2024

A ggml (C++) re-implementation of tortoise-tts

C++ 153 14 Updated Aug 20, 2024

ChatTTS is a generative speech model for daily dialogue.

Python 11 1 Updated Sep 2, 2024

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,410 158 Updated Oct 7, 2024

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 253 23 Updated Oct 6, 2024

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 653 81 Updated Oct 8, 2024

Huawei Grad-TTS for Chinese

Python 43 3 Updated Sep 26, 2023

Text-to-Speech Latency Benchmark

Python 3 Updated Aug 22, 2024

Awesome speech/audio LLMs, representation learning, and codec models

626 28 Updated Sep 24, 2024

High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec

Jupyter Notebook 66 6 Updated May 23, 2024

A generative speech model for daily dialogue.

Python 31,285 3,388 Updated Sep 21, 2024

Inference and training library for high-quality TTS models.

Python 4,323 437 Updated Sep 23, 2024

Cross-platform automation framework for all kinds of apps, built on top of the W3C WebDriver protocol

JavaScript 18,752 6,067 Updated Oct 8, 2024

Streaming Text to Speech Web UI

HTML 13 2 Updated May 6, 2024

STFT based real-time pitch and timbre shifting in C++ and Python

C 120 14 Updated Apr 1, 2024

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 10,691 1,540 Updated Sep 29, 2024

a lightweight voice conversion

Python 78 11 Updated Sep 2, 2024

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

Python 83 4 Updated Jul 24, 2024
2 Updated Sep 22, 2023

Bert-vits2-V2.3 训练和推理

Python 43 17 Updated Mar 13, 2024

Converts text to speech in realtime

Python 1,818 168 Updated Oct 7, 2024

TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization

Python 80 15 Updated Feb 5, 2024

C++ version of pyannote audio overlapped speech detection pipeline

Python 7 1 Updated Feb 14, 2024

ApacheCN 深度学习译文集

JavaScript 786 197 Updated Mar 28, 2023

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 293 44 Updated Sep 13, 2024

深度定制属于自己的EPG节目预告、高清台标

3,880 568 Updated Sep 13, 2024
Next