Skip to content
View Chen-yanyi's full-sized avatar

Block or report Chen-yanyi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ICRA2025]: MVCTrack: Boosting 3D Point Cloud Tracking via Multimodal-Guided Virtual Cues

Python 8 Updated Mar 18, 2025

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 10,054 910 Updated Aug 7, 2024

Generating Robotic Simulation Tasks via Large Language Models

Python 317 24 Updated Mar 23, 2024

Code and data for Vitruvion: A Generative Model of Parametric CAD Sketches (ICLR 2022)

Jupyter Notebook 46 17 Updated Sep 28, 2022

Fine-Tuning Vision-Language-Action Models: Optimizing Speed and Success

Python 208 7 Updated Mar 9, 2025

Augment robotics demonstration datasets with different robots and viewpoints

Python 20 1 Updated Feb 27, 2025
Jupyter Notebook 86 3 Updated Nov 10, 2024

The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 1,816 116 Updated Mar 20, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,933 2,405 Updated Aug 12, 2024

The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.

Python 223 14 Updated Jan 29, 2025
Python 36 Updated May 13, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,885 181 Updated Dec 21, 2024

[ICML 2023] Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining

Python 143 13 Updated Jul 21, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 28,029 3,506 Updated Jul 23, 2024

SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Python 115 7 Updated Mar 23, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,850 1,790 Updated Mar 21, 2025

DROID Policy Learning and Evaluation

Python 175 14 Updated Dec 21, 2024

Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

Python 103 6 Updated Dec 14, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,567 730 Updated Dec 17, 2024

This note presents in a technical though hopefully pedagogical way the three most common forms of neural network architectures: Feedforward, Convolutional and Recurrent.

TeX 1,388 107 Updated Oct 9, 2019

Tools to Design or Visualize Architecture of Neural Network

4,735 598 Updated Jan 28, 2024

Code examples in pyTorch and Tensorflow for CS230

Python 4,018 1,005 Updated Mar 24, 2023

we want to create a repo to illustrate usage of transformers in chinese

Shell 2,745 464 Updated Aug 18, 2024

The official repo for the paper "In-Context Imitation Learning via Next-Token Prediction"

Jupyter Notebook 69 5 Updated Mar 17, 2025

Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).

Python 158 11 Updated Jun 27, 2023

Popular J2ME application for GPS navigation in mobile phone

Java 20 2 Updated Feb 6, 2024
Python 198 22 Updated Mar 17, 2025
Python 37 6 Updated Dec 27, 2022
Next