Skip to content
View QianyiWu's full-sized avatar
🎣
🎣

Highlights

  • Pro

Block or report QianyiWu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Wan: Open and Advanced Large-Scale Video Generative Models

Python 603 34 Updated Feb 25, 2025

Code for the project "MegaSaM: Accurate, Fast and Robust Structure and Motion from Casual Dynamic Videos"

Python 563 15 Updated Feb 20, 2025

Video Generation Foundation Models: https://saiyan-world.github.io/goku/

Python 2,462 256 Updated Feb 19, 2025

Improving Video Generation with Human Feedback

Python 104 Updated Feb 12, 2025

3D Reconstruction for all

Rust 1,426 53 Updated Feb 25, 2025

Official implementation of Continuous 3D Perception Model with Persistent State

Python 603 23 Updated Feb 24, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 16,301 2,146 Updated Feb 1, 2025

Video Depth Anything: Consistent Depth Estimation for Super-Long Videos

Python 648 39 Updated Feb 13, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 6,484 488 Updated Feb 24, 2025

💪 [ARXIV 2025] Pytorch implementation of 'HAC++: Towards 100X Compression of 3D Gaussian Splatting'

Python 87 8 Updated Feb 10, 2025

Code for "Real3D: Scaling Up Large Reconstruction Models with Real-World Images"

Python 163 3 Updated Jun 13, 2024

🍳 [arXiv'24] PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting

Python 89 6 Updated Feb 22, 2025

Self-reimplemented version of Long-LRM.

Jupyter Notebook 127 3 Updated Feb 22, 2025

Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

Python 152 4 Updated Nov 8, 2024

Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.

782 23 Updated Jan 14, 2025

Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Consistent Gaussian Splatting

Python 34 1 Updated Jan 14, 2025

[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Python 478 16 Updated Feb 22, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,578 481 Updated Feb 12, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,028 97 Updated Jan 2, 2025

Official repository for "Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders"

Python 277 10 Updated Feb 24, 2025

Code release for https://kovenyu.com/WonderWorld/

Python 442 20 Updated Dec 22, 2024

Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Python 121 5 Updated Dec 20, 2024

Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data

Python 137 3 Updated Dec 19, 2024

Prompt Depth Anything

Python 558 28 Updated Feb 17, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 24,057 2,073 Updated Feb 25, 2025
Python 51 Updated Feb 7, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,039 1,579 Updated Feb 20, 2025

[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Jupyter Notebook 302 15 Updated Feb 25, 2025

[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Python 494 14 Updated Dec 11, 2024

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)

Python 1,864 205 Updated Jan 16, 2025
Next