-
Penn State University
- Santa Clara, CA
Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
DeepSeek-VL: Towards Real-World Vision-Language Understanding
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
A generative speech model for daily dialogue.
Lightning fast C++/CUDA neural network framework
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
[ECCV 2022] RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
3D reconstruction project with MVSNets for depth inferring.
(CVPR 2022) TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers.
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
[IEEE T-PAMI 2023] Awesome BEV perception research and cookbook for all level audience in autonomous diriving
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
KITTI Object Visualization (Birdview, Volumetric LiDar point cloud )
Monocular Depth Estimation Toolbox based on MMSegmentation.
Code for GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose (CVPR 2018)
COLMAP - Structure-from-Motion and Multi-View Stereo
Segment Anything in High Quality [NeurIPS 2023]
Mask Transfiner for High-Quality Instance Segmentation, CVPR 2022
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
🎨 Semantic segmentation models, datasets and losses implemented in PyTorch.