Stars
The official code for paper: GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expressions (Accepted by AAAI 2025))
A feature-rich command-line audio/video downloader
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
A quality zero-shot lipsync pipeline built with MuseTalk, LivePortrait, and CodeFormer.
Offical implement of Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for talking head Video Generation
Image composition toolbox: everything you want to know about image composition or object insertion
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Inpaint anything using Segment Anything and inpainting models.
[ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting
[CVPR2023] Blind Video Deflickering by Neural Filtering with a Flawed Atlas
[ECCV 2022] CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face D…
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Source code for the CVPR'20 paper "Blindly Assess Image Quality in the Wild Guided by A Self-Adaptive Hyper Network"
State-of-the-art 2D and 3D Face Analysis Project
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
哔哩下载姬downkyi,哔哩哔哩网站视频下载工具,支持批量下载,支持8K、HDR、杜比视界,提供工具箱(音视频提取、去水印等)。
This is the official source for our ICCV 2023 paper "EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation"
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
[CSUR] A Survey on Video Diffusion Models