Skip to content
View travisddavies's full-sized avatar
🎯
修行
🎯
修行
  • 智澄英达
  • Hangzhou, China

Block or report travisddavies

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenEMMA, a permissively licensed open source reproduction of Waymo’s EMMA model.

Python 347 38 Updated Jan 3, 2025

Official repo for "VisionZip: Longer is Better but Not Necessary in Vision Language Models"

Python 206 8 Updated Dec 28, 2024

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 749 67 Updated Dec 24, 2024

Inpaint images with ControlNet

Jupyter Notebook 348 30 Updated Jun 18, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 870 34 Updated Jan 4, 2025

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

609 32 Updated Nov 4, 2024

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 391 14 Updated Oct 29, 2024

[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning

Python 53 2 Updated Oct 14, 2024

Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Python 66 5 Updated Dec 24, 2024

Roadmap to become a Visual-SLAM developer in 2023

1,419 144 Updated Feb 1, 2024

Highly recommended resources for SLAM newbies (Lecture, Reviewed paper, Books, Tutorial, etc)

98 8 Updated Jul 29, 2023

ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM

C++ 40 13 Updated Apr 29, 2022

Tools to distill the Hiera transformer backbone to CNNs that are easier to deploy on the edge.

Python 3 1 Updated Dec 4, 2024

Camera calibration tool

C++ 240 49 Updated Nov 15, 2024

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 745 143 Updated Dec 18, 2024

A curated list of awesome SLAM tutorials, projects and communities.

1,575 378 Updated Jul 13, 2020

[NeurIPS 2024] A Generalizable World Model for Autonomous Driving

Python 624 48 Updated Dec 12, 2024

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 596 63 Updated Aug 30, 2024

EDM2 and Autoguidance -- Official PyTorch implementation

Python 594 25 Updated Dec 9, 2024

High-resolution models for human tasks.

Python 4,713 269 Updated Nov 18, 2024

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

1,231 58 Updated Mar 14, 2024

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,475 1,229 Updated Jul 23, 2024
Jupyter Notebook 962 68 Updated Nov 27, 2024

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 988 181 Updated Jul 31, 2024

An example RLDS dataset builder for X-embodiment dataset conversion.

Python 108 128 Updated Jul 11, 2024

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 4,267 370 Updated Dec 22, 2024

NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs

C++ 437 59 Updated Jan 2, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,446 1,299 Updated Dec 25, 2024

open source 3D computer vision library

C++ 72 16 Updated May 9, 2020
Next