Starred repositories
Official implementation of HumanVid, NeurIPS D&B Track 2024
Papers, datasets, and resources related to 2D cartoon video research. Contributions welcome.
[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
[arXiv 2024] MultiBooth: This repo is the official implementation of "MultiBooth: Towards Generating All Your Concepts in an Image from Text"
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
Using Claude Opus to reverse engineer code from VASA white paper - WIP - (this is for La Raza 🎷)
Lumina-T2X is a unified framework for Text to Any Modality Generation
CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Pythonic AI generation of images and videos
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks"
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
[CVPR2024] Official implementation of High-fidelity Person-centric Subject-to-Image Synthesis.
[CVPR 2024] Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models