-
MuseTalk Public
Forked from TMElyralab/MuseTalkMuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Python MIT License UpdatedApr 3, 2024 -
stable-diffusion-webui Public
Forked from AUTOMATIC1111/stable-diffusion-webuiStable Diffusion web UI
Python GNU Affero General Public License v3.0 UpdatedMar 30, 2024 -
audio2photoreal Public
Forked from facebookresearch/audio2photorealCode and dataset for photorealistic Codec Avatars driven from audio
Python Other UpdatedMar 29, 2024 -
AniPortrait Public
Forked from Zejun-Yang/AniPortraitAniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Python Apache License 2.0 UpdatedMar 29, 2024 -
digital_human_video_player Public
Forked from Ikaros-521/digital_human_video_player带HTTP API的数字人视频播放器,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus
Python GNU General Public License v3.0 UpdatedMar 18, 2024 -
grok-1 Public
Forked from xai-org/grok-1Grok open release
Python Apache License 2.0 UpdatedMar 17, 2024 -
Real-Time-Voice-Cloning Public
Forked from CorentinJ/Real-Time-Voice-CloningClone a voice in 5 seconds to generate arbitrary speech in real-time
Python Other UpdatedMar 14, 2024 -
MeloTTS Public
Forked from myshell-ai/MeloTTSHigh-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Python MIT License UpdatedMar 7, 2024 -
Open-Sora-Plan Public
Forked from PKU-YuanGroup/Open-Sora-PlanThis project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
Python MIT License UpdatedMar 7, 2024 -
RWKV-Runner Public
Forked from josStorer/RWKV-RunnerA RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for c…
TypeScript MIT License UpdatedMar 5, 2024 -
SyncTalk Public
Forked from ZiqiaoPeng/SyncTalk[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
Python Other UpdatedMar 4, 2024 -
emotion2vec Public
Forked from ddlBoJack/emotion2vecOfficial PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Python UpdatedMar 2, 2024 -
RWKV-LM Public
Forked from BlinkDL/RWKV-LMRWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…
Python Apache License 2.0 UpdatedFeb 29, 2024 -
lip_mask Public
Forked from foxyear-kyumin/lip_mask通过此代码可以免训练模型并通过轻量级服务器定制数字人形象
Python Apache License 2.0 UpdatedFeb 22, 2024 -
GeneFacePlusPlus Public
Forked from yerfor/GeneFacePlusPlusGeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
Python UpdatedFeb 14, 2024 -
SadTalker Public
Forked from OpenTalker/SadTalker[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Python Other UpdatedFeb 11, 2024 -
VALL-E-X Public
Forked from Plachtaa/VALL-E-XAn open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Python MIT License UpdatedFeb 11, 2024 -
magic-animate Public
Forked from magic-research/magic-animate[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 6, 2024 -
Retrieval-based-Voice-Conversion-WebUI Public
Forked from RVC-Project/Retrieval-based-Voice-Conversion-WebUIVoice data <= 10 mins can also be used to train a good VC model!
Python MIT License UpdatedJan 23, 2024 -
seamless_communication Public
Forked from facebookresearch/seamless_communicationFoundational Models for State-of-the-Art Speech and Text Translation
Jupyter Notebook Other UpdatedJan 22, 2024 -
sd-wav2lip-uhq Public
Forked from numz/sd-wav2lip-uhqWav2Lip UHQ extension for Automatic1111
Python Apache License 2.0 UpdatedJan 22, 2024 -
GPT-SoVITS Public
Forked from RVC-Boss/GPT-SoVITS1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Python MIT License UpdatedJan 19, 2024 -
Bark-Voice-Cloning Public
Forked from KevinWang676/Bark-Voice-CloningBark Voice Cloning and Voice Cloning for Chinese Speech
Jupyter Notebook MIT License UpdatedJan 18, 2024 -
TTS Public
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Python Mozilla Public License 2.0 UpdatedJan 17, 2024 -
langchain-ChatGLM Public
Forked from chatchat-space/Langchain-Chatchatlangchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答
Python Apache License 2.0 UpdatedJan 17, 2024 -
-
-
bark Public
Forked from suno-ai/bark🔊 Text-Prompted Generative Audio Model
Jupyter Notebook MIT License UpdatedDec 14, 2023 -
so-vits-svc Public
Forked from svc-develop-team/so-vits-svcSoftVC VITS Singing Voice Conversion
Python GNU Affero General Public License v3.0 UpdatedNov 11, 2023 -
ER-NeRF Public
Forked from Fictionarry/ER-NeRF[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
Python MIT License UpdatedNov 7, 2023