aixingxy

xingxy aixingxy

TTS

9 followers · 128 following

Starred repositories

WangHelin1997 / SSR-Speech

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis

Python 75 9 Updated Sep 24, 2024

yl4579 / StyleTTS-ZS

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

141 7 Updated Sep 27, 2024

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 233 13 Updated Sep 25, 2024

PolyAI-LDN / pheme

Python 248 23 Updated Mar 15, 2024

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 7,869 1,117 Updated Oct 7, 2024

balisujohn / tortoise.cpp

A ggml (C++) re-implementation of tortoise-tts

C++ 153 14 Updated Aug 20, 2024

ZillaRU / ChatTTS-ONNX

Forked from 2noise/ChatTTS

ChatTTS is a generative speech model for daily dialogue.

Python 11 1 Updated Sep 2, 2024

DigitalPhonetics / IMS-Toucan

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,410 158 Updated Oct 7, 2024

lucidrains / e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 253 23 Updated Oct 6, 2024

shivammehta25 / Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 653 81 Updated Oct 8, 2024

MaxMax2016 / Grad-TTS-Chinese

Huawei Grad-TTS for Chinese

Python 43 3 Updated Sep 26, 2023

Picovoice / tts-latency-benchmark

Text-to-Speech Latency Benchmark

Python 3 Updated Aug 22, 2024

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

626 28 Updated Sep 24, 2024

aask1357 / hilcodec

High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec

Jupyter Notebook 66 6 Updated May 23, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 31,285 3,388 Updated Sep 21, 2024

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 4,323 437 Updated Sep 23, 2024

appium / appium

Cross-platform automation framework for all kinds of apps, built on top of the W3C WebDriver protocol

JavaScript 18,752 6,067 Updated Oct 8, 2024

pengzhendong / streaming-tts-webui

Streaming Text to Speech Web UI

HTML 13 2 Updated May 6, 2024

jurihock / stftPitchShift

STFT based real-time pitch and timbre shifting in C++ and Python

C 120 14 Updated Apr 1, 2024

chenzomi12 / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 10,691 1,540 Updated Sep 29, 2024

uthree / tinyvc

a lightweight voice conversion

Python 78 11 Updated Sep 2, 2024

habla-liaa / encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

Python 83 4 Updated Jul 24, 2024

yuanlehome / Hackathon

2 Updated Sep 22, 2023

v3ucn / Bert-vits2-V2.3

Bert-vits2-V2.3 训练和推理

Python 43 17 Updated Mar 13, 2024

KoljaB / RealtimeTTS

Converts text to speech in realtime

Python 1,818 168 Updated Oct 7, 2024

Jackiexiao / tts-frontend-dataset

TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization

Python 80 15 Updated Feb 5, 2024

leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp

C++ version of pyannote audio overlapped speech detection pipeline

Python 7 1 Updated Feb 14, 2024

apachecn / apachecn-dl-zh

ApacheCN 深度学习译文集

JavaScript 786 197 Updated Mar 28, 2023

yxlu-0102 / MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 293 44 Updated Sep 13, 2024

Meroser / IPTV

深度定制属于自己的EPG节目预告、高清台标

3,880 568 Updated Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly