Stars
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Efficient vision foundation models for high-resolution generation and perception.
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
codes for paper "Pose-aware Attention Network for Flexible Motion Retargeting by Body Part" (TVCG2023)
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
🔥🔥🔥 Set the world of 3D faces on fire with INFERNO 🔥🔥🔥
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing work.
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Auto detecting, masking and inpainting with detection model.
Easy Docker setup for Stable Diffusion with user-friendly UI
Latent Consistency Model for AUTOMATIC1111 Stable Diffusion WebUI
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
[AAAI 2023] Exploring CLIP for Assessing the Look and Feel of Images