Highlights
- Pro
Stars
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
Official implementation of "Exploring Temporally-Aware Features for Point Tracking"
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Simple creation of data classes from dictionaries.
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
Persistent remote applications for X11; screen sharing for X11, MacOS and MSWindows.
Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
MEt3R: Measuring Multi-View Consistency in Generated Images
An open source Multi-View Latent Diffusion Model
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
Flexible Python configuration system. The last one you will ever need.
Hydra is a framework for elegantly configuring complex applications
The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
Python library for loading and using triangular meshes.
Easy-to-use glTF 2.0-compliant OpenGL renderer for visualization of 3D scenes.
COLMAP - Structure-from-Motion and Multi-View Stereo
[CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
DepthSplat: Connecting Gaussian Splatting and Depth
Code for RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion [Arxiv 2024]
[arXiv'24] GAGS: Granularity-Aware 3D Feature Distillation for Gaussian Splatting
[arXiv'24] [Image-to-Scene on a 4090(24G)] VistaDream: Sampling multiview consistent images for single-view scene reconstruction
[AAAI 2025] DepthFM: Fast Monocular Depth Estimation with Flow Matching