-
moshi Public
Forked from kyutai-labs/moshi类似GPT4O,语音端到端交互
Python Apache License 2.0 UpdatedSep 19, 2024 -
seed-vc Public
Forked from Plachtaa/seed-vcseed-tts: zero-shot voice conversion with in context learning
Python MIT License UpdatedSep 14, 2024 -
stable-speech Public
Forked from huggingface/parler-ttsReproduction of Stability AI's Text-to-Speech model.
Python Apache License 2.0 UpdatedSep 14, 2024 -
speech-trident Public
Forked from ga642381/speech-tridentAwesome speech/audio LLMs, representation learning, and codec models
2 UpdatedSep 14, 2024 -
e2-tts-pytorch Public
Forked from lucidrains/e2-tts-pytorchFlow-matching Transformer,Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Python MIT License UpdatedSep 10, 2024 -
-
HierSpeechpp Public
Forked from sh-lee-prml/HierSpeechppThe official implementation of HierSpeech++
Python MIT License UpdatedSep 5, 2024 -
TTS-arxiv-daily Public
Forked from liutaocode/TTS-arxiv-dailyAutomatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)
Python Apache License 2.0 UpdatedAug 22, 2024 -
REAL_TIME_NKF_AEC Public
Forked from William1617/REAL_TIME_NKF_AEC神经网络回声消除,C实现
-
FasterLivePortrait Public
Forked from warmshao/FasterLivePortraitBring portraits to life in Real Time!onnx/tensorrt support!
Python UpdatedJul 25, 2024 -
gryannote Public
Forked from clement-pages/gryannote说话人识别,Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
Svelte MIT License UpdatedJul 19, 2024 -
SysMocap Public
Forked from xianfei/SysMocap数字人动捕和驱动完整方案 A real-time motion capture system for 3D virtual character animating.
JavaScript Mozilla Public License 2.0 UpdatedJul 18, 2024 -
SenseVoice-onnx Public
Forked from lovemefan/SenseVoice-pythonsensevoice with onnx runtime
Python UpdatedJul 18, 2024 -
Qwen2-Audio Public
Forked from QwenLM/Qwen2-AudioThe official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
UpdatedJul 16, 2024 -
tinyspeech Public
Forked from AkshathRaghav/tinyspeechCode release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"
Python MIT License UpdatedJul 15, 2024 -
Diff-MST Public
Forked from sai-soum/Diff-MST音乐生成,Multitrack music mixing style transfer given a reference song using differentiable mixing console.
Jupyter Notebook Other UpdatedJul 11, 2024 -
optispeech Public
Forked from mush42/optispeechTTS, A lightweight end-to-end text-to-speech model
Python MIT License UpdatedJul 10, 2024 -
BigVGAN-Official Public
Forked from NVIDIA/BigVGAN终于开源了 Official implementation of BigVGAN in PyTorch
Python MIT License UpdatedJul 10, 2024 -
silero-vad Public
Forked from snakers4/silero-vad大数据训练的VAD Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
Python MIT License UpdatedJul 10, 2024 -
MaxKB Public
Forked from 1Panel-dev/MaxKBRAG 🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统,1Panel 官方出品。
Python GNU General Public License v3.0 UpdatedJul 9, 2024 -
MARS5-TTS Public
Forked from Camb-ai/MARS5-TTSMARS5 speech model (TTS) from CAMB.AI
Python GNU Affero General Public License v3.0 UpdatedJul 8, 2024 -
LivePortrait Public
Forked from KwaiVGI/LivePortrait头像动作迁移,Make one portrait alive!
Python MIT License UpdatedJul 8, 2024 -
CosyVoice Public
Forked from FunAudioLLM/CosyVoiceLLM based TTS model, providing inference/training/deployment full-stack ability.
-
promonet Public
Forked from maxrmorrison/promonet语音编辑,Prosody and Pronunciation Modification Network
Python MIT License UpdatedJul 7, 2024 -
SenseVoice Public
Forked from FunAudioLLM/SenseVoice语音克隆数据清洗必备,Multilingual Voice Understanding Model
-
StreamingHiFiGAN Public
Forked from facebookresearch/AudioDecAn Open-source Streaming High-fidelity Neural Audio Codec
-
DEX-TTS Public
Forked from winddori2002/DEX-TTSDEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
-
noise-reduction Public
Forked from dengcunqin/noise-reductionnoise reduction
Python UpdatedJul 3, 2024 -
faster-whisper Public
Forked from SYSTRAN/faster-whisperFaster Whisper transcription with CTranslate2
Python MIT License UpdatedJul 2, 2024 -
BetterFastSpeech2 Public
Forked from shivammehta25/BetterFastSpeech2代码美化重构版,FastSpeech2
Jupyter Notebook MIT License UpdatedJul 1, 2024