Skip to content
View YanyuanQiao's full-sized avatar

Block or report YanyuanQiao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 2,316 296 Updated Mar 4, 2025

[TMLR 2024] repository for VLN with foundation models

78 3 Updated Mar 21, 2025

[RSS 2024] NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

Python 110 6 Updated Mar 14, 2025

Vision-Language Navigation Benchmark in Isaac Lab

Python 114 7 Updated Dec 20, 2024

This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and continuously update our survey, we maintain this repository of rel…

63 4 Updated Jul 26, 2024

The official code repository for PRMBench.

Jupyter Notebook 68 5 Updated Feb 15, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 40,256 6,622 Updated Dec 9, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 24,496 2,140 Updated Mar 23, 2025

[ACL 24] The official implementation of MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation.

Python 69 8 Updated Oct 22, 2024

A library for generative social simulation

Python 818 177 Updated Mar 21, 2025

[ECCV 2024] Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve performance of numerous zero-shot vision language tasks.

Python 319 34 Updated Mar 28, 2024
Python 34 2 Updated Sep 30, 2023

Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", ICRA 2024

Python 89 6 Updated Oct 30, 2024

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

Python 768 99 Updated Sep 15, 2024

Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).

Python 57 2 Updated Mar 6, 2025

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots

Python 689 67 Updated Mar 22, 2025

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Python 755 77 Updated Mar 22, 2025

[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation

Python 170 5 Updated Oct 8, 2024