mrexxie

🎯

Focusing

Rex Xie mrexxie

🎯

Focusing

3 followers · 34 following

Stars

tts

21 repositories

thewh1teagle / kokoro-onnx

TTS with kokoro and onnx runtime

Python 1,813 168 Updated Mar 1, 2025

farshed / sage

Self-hosted voice chat with LLMs

Rust 422 29 Updated Feb 28, 2025

Zyphra / Zonos

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,236 668 Updated Mar 5, 2025

jianchang512 / ChatTTS-ui

一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python 6,883 834 Updated Dec 9, 2024

index-tts / index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 387 21 Updated Mar 28, 2025

chrischoy / WhisperChain

Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what you said!

Python 283 14 Updated Feb 9, 2025

SesameAILabs / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 45 7 Updated Oct 25, 2024

jianchang512 / fireredasr-ui

一个中文语音转文字项目，封装自FireRedASR

Python 38 9 Updated Feb 24, 2025

FireRedTeam / FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 826 61 Updated Mar 27, 2025

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 648 52 Updated Oct 17, 2024

SparkAudio / Spark-TTS

Spark-TTS Inference Code

Python 6,866 710 Updated Mar 21, 2025

zhaoyun0071 / FireRedTTS-windows-GUI

FireRedTTS解压即用，不用配置环境

5 1 Updated Oct 13, 2024

lenML / Speech-AI-Forge

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 1,142 149 Updated Mar 26, 2025

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,866 692 Updated Mar 3, 2025

SesameAILabs / csm

A Conversational Speech Generation Model

Python 11,862 992 Updated Mar 27, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 43,264 4,816 Updated Mar 26, 2025

asiff00 / On-Device-Speech-to-Speech-Conversational-AI

This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and…

Python 89 10 Updated Feb 16, 2025

canopyai / Orpheus-TTS

TTS Towards Human-Sounding Speech

Python 3,105 222 Updated Mar 27, 2025

FreedomIntelligence / Soundwave

The official Soundwave repository

Python 181 19 Updated Mar 16, 2025

metame-ai / awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

377 17 Updated Mar 26, 2025

kyutai-labs / moshivis

Kyutai with an "eye"

Python 157 21 Updated Mar 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly