Lists (1)
Sort Name ascending (A-Z)
Stars
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
A generative speech model for daily dialogue.
Foundational model for human-like, expressive TTS
Text Normalization & Inverse Text Normalization
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
So-VITS-SVC 本地部署/训练/推理/使用帮助文档 So-VITS-SVC Local Deployment/Training/Inference/Usage Help Document
Soft speech units for voice conversion
An unofficial implementation of the combination of Soft-VC and VITS
SoftVC VITS Singing Voice Conversion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
100+ Chinese Word Vectors 上百种预训练中文词向量
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
ChatGPT 中文指南🔥,ChatGPT 中文调教指南,指令指南,应用开发指南,精选资源清单,更好的使用 chatGPT 让你的生产力 up up up! 🚀
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
A unified framework for privacy-preserving data analysis and machine learning