Highlights
- Pro
Stars
Video Generation Foundation Models: https://saiyan-world.github.io/goku/
[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
[NeurIPS 2024] Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
[CVPR 2025] Official repository for "Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders"
Official implementation of "HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset"
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
[CVPR 2024] Arbitrary Motion Style Transfer with Multi-condition Motion Latent Diffusion Model
[LCLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation
Retargeting of the Motorica Dance dataset onto a common skeleton
Modern protocol-side framework implemented based on NTQQ
Official implementation of dual quaternion transformations as described in the paper "Pose Representations for Deep Skeletal Animation".
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Unity framework for motion alignment across different morphologies with no supervision [SIGGRAPH 2024]
A vector-quantized periodic autoencoder (VQ-PAE) for motion alignment across different morphologies with no supervision [SIGGRAPH 2024]
(SIGGRAPH 2024) Official repository for "Taming Diffusion Probabilistic Models for Character Control"
Repository for our paper: FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning, Proceedings of the 12th International Conference on Learning Representations (ICLR)
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
PantoMatrix: Generating Face and Body Animation from Speech
[Open-source Project] UniMoCap: community implementation to unify the text-motion datasets (HumanML3D, KIT-ML, and BABEL) and whole-body motion dataset (Motion-X).