Highlights
- Pro
Stars
[arXiv 2023] DreamGaussian4D: Generative 4D Gaussian Splatting
This is the official implementation of Tensor4D: Efficient Neural 4D Decomposition for High-fidelity Dynamic Reconstruction and Rendering.
A collaboration friendly studio for NeRFs
Code for "Point-based Multi-view Stereo Network" (ICCV 2019 Oral) & "Visibility-aware Point-based Multi-view Stereo Network" (TPAMI)
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks
A Unified Framework for Quantifying Privacy Risk in Synthetic Data according to the GDPR
[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics
Modeling and Scaling Generative AI Systems for BeNefri master program
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
RDPWrap.ini for RDP Wrapper Library by Stas'M
COLMAP - Structure-from-Motion and Multi-View Stereo
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Codes for "CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation"
[CVPR'24 - Rebuttal Score 554] GenN2N: Generative NeRF2NeRF Translation
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Official implementation for "CLIP-ReID: Exploiting Vision-Language Model for Image Re-identification without Concrete Text Labels" (AAAI 2023)
[CVPR 2024] SiTH: Single-view Textured Human Reconstruction with Image-Conditioned Diffusion
Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Document Layout Segmentation with a Inception base UNET architecture
vit-pytorch-with-pretrained-weights
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch