XxSuper

XxSuper

2 followers · 2 following

MuseTalk Public
Forked from TMElyralab/MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python MIT License Updated Apr 3, 2024
stable-diffusion-webui Public
Forked from AUTOMATIC1111/stable-diffusion-webui

Stable Diffusion web UI

Python GNU Affero General Public License v3.0 Updated Mar 30, 2024
audio2photoreal Public
Forked from facebookresearch/audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Python Other Updated Mar 29, 2024
AniPortrait Public
Forked from Zejun-Yang/AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python Apache License 2.0 Updated Mar 29, 2024
digital_human_video_player Public
Forked from Ikaros-521/digital_human_video_player

带HTTP API的数字人视频播放器，使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus

Python GNU General Public License v3.0 Updated Mar 18, 2024
grok-1 Public
Forked from xai-org/grok-1

Grok open release

Python Apache License 2.0 Updated Mar 17, 2024
Real-Time-Voice-Cloning Public
Forked from CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python Other Updated Mar 14, 2024
MeloTTS Public
Forked from myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python MIT License Updated Mar 7, 2024
Open-Sora-Plan Public
Forked from PKU-YuanGroup/Open-Sora-Plan

This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

Python MIT License Updated Mar 7, 2024
RWKV-Runner Public
Forked from josStorer/RWKV-Runner

A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for c…

TypeScript MIT License Updated Mar 5, 2024
SyncTalk Public
Forked from ZiqiaoPeng/SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Python Other Updated Mar 4, 2024
emotion2vec Public
Forked from ddlBoJack/emotion2vec

Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python Updated Mar 2, 2024
RWKV-LM Public
Forked from BlinkDL/RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python Apache License 2.0 Updated Feb 29, 2024
lip_mask Public
Forked from foxyear-kyumin/lip_mask

通过此代码可以免训练模型并通过轻量级服务器定制数字人形象

Python Apache License 2.0 Updated Feb 22, 2024
GeneFacePlusPlus Public
Forked from yerfor/GeneFacePlusPlus

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Python Updated Feb 14, 2024
SadTalker Public
Forked from OpenTalker/SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python Other Updated Feb 11, 2024
VALL-E-X Public
Forked from Plachtaa/VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python MIT License Updated Feb 11, 2024
magic-animate Public
Forked from magic-research/magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python BSD 3-Clause "New" or "Revised" License Updated Feb 6, 2024
Retrieval-based-Voice-Conversion-WebUI Public
Forked from RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Voice data <= 10 mins can also be used to train a good VC model!

Python MIT License Updated Jan 23, 2024
seamless_communication Public
Forked from facebookresearch/seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook Other Updated Jan 22, 2024
sd-wav2lip-uhq Public
Forked from numz/sd-wav2lip-uhq

Wav2Lip UHQ extension for Automatic1111

Python Apache License 2.0 Updated Jan 22, 2024
GPT-SoVITS Public
Forked from RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python MIT License Updated Jan 19, 2024
Bark-Voice-Cloning Public
Forked from KevinWang676/Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Jupyter Notebook MIT License Updated Jan 18, 2024
TTS Public
Forked from coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python Mozilla Public License 2.0 Updated Jan 17, 2024
langchain-ChatGLM Public
Forked from chatchat-space/Langchain-Chatchat

langchain-ChatGLM, local knowledge based ChatGLM with langchain ｜基于本地知识库的 ChatGLM 问答

Python Apache License 2.0 Updated Jan 17, 2024
wav2lip384 Public
Forked from nghiakvnvsd/wav2lip384

Python Updated Dec 27, 2023
wav2lip_288x288 Public
Forked from primepake/wav2lip_288x288

Python Updated Dec 21, 2023
bark Public
Forked from suno-ai/bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook MIT License Updated Dec 14, 2023
so-vits-svc Public
Forked from svc-develop-team/so-vits-svc

SoftVC VITS Singing Voice Conversion

Python GNU Affero General Public License v3.0 Updated Nov 11, 2023
ER-NeRF Public
Forked from Fictionarry/ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python MIT License Updated Nov 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XxSuper

Block or report XxSuper

MuseTalk Public

stable-diffusion-webui Public

audio2photoreal Public

AniPortrait Public

digital_human_video_player Public

grok-1 Public

Real-Time-Voice-Cloning Public

MeloTTS Public

Open-Sora-Plan Public

RWKV-Runner Public

SyncTalk Public

emotion2vec Public

RWKV-LM Public

lip_mask Public

GeneFacePlusPlus Public

SadTalker Public

VALL-E-X Public

magic-animate Public

Retrieval-based-Voice-Conversion-WebUI Public

seamless_communication Public

sd-wav2lip-uhq Public

GPT-SoVITS Public

Bark-Voice-Cloning Public

TTS Public

langchain-ChatGLM Public

wav2lip384 Public

wav2lip_288x288 Public

bark Public

so-vits-svc Public

ER-NeRF Public