Skip to content
View Jay-Ye's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report Jay-Ye

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NLM for LogiCity

Python 2 Updated May 15, 2024
HTML 2 Updated Jan 26, 2025

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,113 277 Updated Jan 21, 2025

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,423 463 Updated Feb 11, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 23,725 2,023 Updated Feb 10, 2025

An updated version of Nishanth's pbrspot library, added aligned body frame and hand frame with the real Spot services.

C++ 1 Updated Dec 11, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,624 162 Updated Dec 21, 2024
Python 877 64 Updated Aug 13, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 13,977 1,415 Updated Dec 25, 2024

Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"

Python 6,461 403 Updated Jan 29, 2025

LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".

Python 20 2 Updated Nov 12, 2024

No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Python 632 25 Updated Dec 7, 2024

MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 725 42 Updated Dec 8, 2024

Summary of key papers and blogs about diffusion models to learn about the topic. Detailed list of all published diffusion robotics papers.

764 39 Updated Sep 20, 2024

code for the paper Predicting Point Tracks from Internet Videos enables Diverse Zero-Shot Manipulation

Python 76 6 Updated Jul 31, 2024

Official repository of Learning to Act from Actionless Videos through Dense Correspondences.

Python 194 20 Updated Apr 25, 2024

This code corresponds to simulation environments used as part of the MimicGen project.

Python 372 63 Updated Jan 17, 2025

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 2,759 271 Updated Dec 21, 2024

This is the official implementation of Video Generation part of This&That: Language-Gesture Controlled Video Generation for Robot Planning (ICRA 2025)

Python 27 1 Updated Feb 8, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,578 990 Updated Jan 22, 2025

Stable Video Diffusion Training Code and Extensions.

Python 667 65 Updated Jul 25, 2024

Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.

Python 453 27 Updated Dec 6, 2024

A list of awesome and popular robot learning environments

96 1 Updated Aug 17, 2024

GLOMAP - Global Structured-from-Motion Revisited

C++ 1,618 114 Updated Jan 9, 2025

Official implementation of RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation

Python 74 4 Updated Dec 30, 2024

Pandora: Towards General World Model with Natural Language Actions and Video States

Python 497 35 Updated Sep 23, 2024
Python 121 22 Updated Mar 3, 2024

NeuroNCAP benchmark for end-to-end autonomous driving

Python 157 6 Updated Oct 14, 2024
Next