-
SZU & TME
Stars
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
A generative speech model for daily dialogue.
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
An In-depth Analysis of Diffusion Probability Model
Unofficial Implementation of Animate Anyone
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
animatediff prompt travel
Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
A General NeRF Acceleration Toolbox in PyTorch.
Lightning fast C++/CUDA neural network framework
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024
Real-time face swap for PC streaming or video calls
Stable Diffusion web UI
Official Code for DragGAN (SIGGRAPH 2023)
ICT's Vision and Graphics Lab's morphable face model and toolkit
[ICCV2023] Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement
NeRD: Neural Reflectance Decomposition from Image Collections - ICCV 2021