Stars
A generative world for general-purpose robotics & embodied AI learning.
We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a se…
Memory-Guided Diffusion for Expressive Talking Video Generation
Unofficial implementation of MIMO (MImicking anyone anywhere with complex Motions and Object interactions)
Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!
HunyuanVideo: A Systematic Framework For Large Video Generation Model
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
MikuDance: Animating Character Art with Mixed Motion Dynamics
[ECCV 2022] AutoTransition: Learning to Recommend Video Transition Effects
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Official implementation of the paper "TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation"
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
Python packaging and dependency management made easy
The Dawn of Video Generation: Preliminary Explorations with SORA-like Models
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling