Skip to content
View mrexxie's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report mrexxie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

tts

21 repositories

TTS with kokoro and onnx runtime

Python 1,813 168 Updated Mar 1, 2025

Self-hosted voice chat with LLMs

Rust 422 29 Updated Feb 28, 2025

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 6,236 668 Updated Mar 5, 2025

一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python 6,883 834 Updated Dec 9, 2024

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 387 21 Updated Mar 28, 2025

Speech to Text but with all the bells and whistles and most importantly AI! AI will clean up your filler words, edit and will refine what you said!

Python 283 14 Updated Feb 9, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 45 7 Updated Oct 25, 2024

一个中文语音转文字项目,封装自FireRedASR

Python 38 9 Updated Feb 24, 2025

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 826 61 Updated Mar 27, 2025

An Open-Sourced LLM-empowered Foundation TTS System

Python 648 52 Updated Oct 17, 2024

Spark-TTS Inference Code

Python 6,866 710 Updated Mar 21, 2025

FireRedTTS解压即用,不用配置环境

5 1 Updated Oct 13, 2024

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 1,142 149 Updated Mar 26, 2025

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,866 692 Updated Mar 3, 2025

A Conversational Speech Generation Model

Python 11,862 992 Updated Mar 27, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 43,264 4,816 Updated Mar 26, 2025

This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and…

Python 89 10 Updated Feb 16, 2025

TTS Towards Human-Sounding Speech

Python 3,105 222 Updated Mar 27, 2025

The official Soundwave repository

Python 181 19 Updated Mar 16, 2025

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

377 17 Updated Mar 26, 2025

Kyutai with an "eye"

Python 157 21 Updated Mar 26, 2025