Stars
a light weight version of mcp sdk inspired by official model context protocol sdk, NOW focus on sse only no stdio
[RecurrentNN × Regression × Regularized]-base Mouth Opening Estimation via SSL(Semi-supervised Learning).
Multispeaker Community Vocoder Model for DiffSinger
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
so-vits-svc fork with realtime support, improved interface and more features.
リアルタイムボイスチェンジャー Realtime Voice Changer
Easily train a good VC model with voice data <= 10 mins!
Singing Voice Conversion via diffusion model
A simple C++ library for reading and writing audio files.
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
A Simple program which can activate cortana(English and Chinese) as optional voice in control panel.
Video import and export in Mathematica using Media Foundation