Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Give us minutes, we give back a faster Mamba. The official implementation of "Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training".
[ARXIV'24] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints
Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"
DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion
A suite of image and video neural tokenizers
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Official repository for Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs
A modular differential gaussian rasterization library.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
ICCV 2023 "Neural Video Depth Stabilizer" (NVDS) & TPAMI 2024 "NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth Estimation" (NVDS+)
[NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation
"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, …
(ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator
A lightweight blobfuse-like python tool with the data transfer through azcopy
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]
The official source code for "X-Ray: A Sequential 3D Representation for Generation".
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
Stable Video Diffusion Training Code and Extensions.
Official code of "Segment any 3D Object with Language"
Everything about note management. All in Zotero.
TC4D: Trajectory-Conditioned Text-to-4D Generation