Stars
Python Data Science Handbook: full text in Jupyter Notebooks
[AAAI 2025] Offical implementation of "DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input"
(AAAI 2025) TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers
Implementation of the paper "Surface Maps via Adaptive Triangulations" (Eurographics 2023)
3D Gaussian Splatting Papers Relating to Large-Scale Scene.
[ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
LLaMA 2 implemented from scratch in PyTorch
These scripts are used to download RealEstate10K dataset.
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
A lightweight tool for camera pose visualization
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
DepthSplat: Connecting Gaussian Splatting and Depth
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
This repo implements a Stable Diffusion model in PyTorch with all the essential components.
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep N…
Learning Resources And Links Of Machine Learning(updating)
[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"
Famous Vision Language Models and Their Architectures
Collection of AWESOME vision-language models for vision tasks
[ECCV 2024] Implementation of latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction
[ECCV 2024] MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
CVPR 2023: Learning to Render Novel Views from Wide-Baseline Stereo Pairs
Mastering Python for Finance – Second Edition, published by Packt
[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis
[ICCV 2023] PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment