Lists (1)
Sort Name ascending (A-Z)
Stars
📖 A curated list of resources dedicated to talking face.
Video Generation Foundation Models: https://saiyan-world.github.io/goku/
Code release for "LLMs can see and hear without any training"
A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning. TPAMI, 2024.
[ACM MM Award] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Famous Vision Language Models and Their Architectures
[ECCV 2024] All You Need is Your Voice: Emotional Face Representation with Audio Perspective for Emotional Talking Face Generation
[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation
[CVPR2020] "Detecting Attended Visual Targets in Video"
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to-video translation.
[ECCV 2022] CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Face super resolution based on ESRGAN
Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors (ECCV2024)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Open source implementation of CVPR 2020 "Video to Events: Recycling Video Dataset for Event Cameras"
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
[Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models [CVPR 2024]