A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

609 32 Updated Nov 4, 2024

UMass-Foundation-Model / 3D-VLA

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 391 14 Updated Oct 29, 2024

HaoyiZhu / PointCloudMatters

[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning

Python 53 2 Updated Oct 14, 2024

PKU-HMI-Lab / LIFT3D

Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Python 66 5 Updated Dec 24, 2024

changh95 / visual-slam-roadmap

Roadmap to become a Visual-SLAM developer in 2023

1,419 144 Updated Feb 1, 2024

Taeyoung96 / SLAM-Resources-for-Beginner

Highly recommended resources for SLAM newbies (Lecture, Reviewed paper, Books, Tutorial, etc)

98 8 Updated Jul 29, 2023

urbste / ORB_SLAM3

Forked from UZ-SLAMLab/ORB_SLAM3

ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM

C++ 40 13 Updated Apr 29, 2022

urbste / nanosam2

Tools to distill the Hiera transformer backbone to CNNs that are easier to deploy on the edge.

Python 3 1 Updated Dec 4, 2024

urbste / OpenImuCameraCalibrator

Camera calibration tool

C++ 240 49 Updated Nov 15, 2024

real-stanford / universal_manipulation_interface

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 745 143 Updated Dec 18, 2024

kanster / awesome-slam

A curated list of awesome SLAM tutorials, projects and communities.

1,575 378 Updated Jul 13, 2020

OpenDriveLab / Vista

[NeurIPS 2024] A Generalizable World Model for Autonomous Driving

Python 624 48 Updated Dec 12, 2024

huangwl18 / ReKep

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 596 63 Updated Aug 30, 2024

NVlabs / edm2

EDM2 and Autoguidance -- Official PyTorch implementation

Python 594 25 Updated Dec 9, 2024

facebookresearch / sapiens

High-resolution models for human tasks.

Python 4,713 269 Updated Nov 18, 2024

Computer-Vision-in-the-Wild / CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,231 58 Updated Mar 14, 2024

facebookresearch / mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,475 1,229 Updated Jul 23, 2024

google-deepmind / open_x_embodiment

Jupyter Notebook 962 68 Updated Nov 27, 2024

octo-models / octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 988 181 Updated Jul 31, 2024

kpertsch / rlds_dataset_builder

An example RLDS dataset builder for X-embodiment dataset conversion.

Python 108 128 Updated Jul 11, 2024

DepthAnything / Depth-Anything-V2

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 4,267 370 Updated Dec 22, 2024

NVIDIA / DCGM

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

C++ 437 59 Updated Jan 2, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,446 1,299 Updated Dec 25, 2024

imkaywu / open3DCV

open source 3D computer vision library

C++ 72 16 Updated May 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Travis Davies travisddavies

Achievements