Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

Python 152 4 Updated Nov 8, 2024

zju3dv / MatchAnything

Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.

782 23 Updated Jan 14, 2025

W-Ted / F3D-Gaus

Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Consistent Gaussian Splatting

Python 34 1 Updated Jan 14, 2025

IGL-HKUST / DiffusionAsShader

[arXiv 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Python 478 16 Updated Feb 22, 2025

NVIDIA / Cosmos

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,578 481 Updated Feb 12, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 2,028 97 Updated Jan 2, 2025

Seed3D / Dora

Official repository for "Dora: Sampling and Benchmarking for 3D Shape Variational Auto-Encoders"

Python 277 10 Updated Feb 24, 2025

KovenYu / WonderWorld

Code release for https://kovenyu.com/WonderWorld/

Python 442 20 Updated Dec 22, 2024

ant-research / LeviTor

Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

Python 121 5 Updated Dec 20, 2024

hwjiang1510 / MegaSynth

Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data

Python 137 3 Updated Dec 19, 2024

DepthAnything / PromptDA

Prompt Depth Anything

Python 558 28 Updated Feb 17, 2025

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 24,057 2,073 Updated Feb 25, 2025

MattWallingford / 360-1M

Python 51 Updated Feb 7, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,039 1,579 Updated Feb 20, 2025

KwaiVGI / 3DTrajMaster

[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Jupyter Notebook 302 15 Updated Feb 25, 2025

KwaiVGI / SynCamMaster

[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Python 494 14 Updated Dec 11, 2024

Pointcept / Pointcept

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)

Python 1,864 205 Updated Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wooqy QianyiWu

Achievements

Achievements

Highlights

Block or report QianyiWu

Stars

Wan-Video / Wan2.1

mega-sam / mega-sam

Saiyan-World / goku

KwaiVGI / VideoAlign

ArthurBrussee / brush

CUT3R / CUT3R

deepseek-ai / Janus

DepthAnything / Video-Depth-Anything

Tencent / Hunyuan3D-2

YihangChen-ee / HAC-plus

hwjiang1510 / Real3D

chengzhag / PanSplat

arthurhero / Long-LRM

KwaiVGI / Koala-36M