Skip to content
View yang-zhikai's full-sized avatar

Block or report yang-zhikai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Code for Reinforcement Learning from Vision Language Foundation Model Feedback

C++ 85 12 Updated May 22, 2024

Solve Visual Understanding with Reinforced VLMs

Python 3,715 222 Updated Mar 4, 2025

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 359 60 Updated Jan 3, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 630 45 Updated Feb 28, 2025

Embodied Chain of Thought: A robotic policy that reason to solve the task.

Python 151 7 Updated Aug 29, 2024

Witness the aha moment of VLM with less than $3.

Python 2,996 239 Updated Mar 1, 2025

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python 3,030 225 Updated Feb 19, 2025

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 9,466 1,047 Updated Mar 3, 2025

The reinforcement learning training code for AgiBot X1.

Python 1,312 425 Updated Oct 23, 2024

train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism

Python 215 20 Updated Nov 21, 2023

Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning

Python 46 3 Updated Jan 27, 2025

本仓库是qwen2VL的微调及推理代码。

Python 8 Updated Sep 19, 2024

Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld

Python 52 Updated Oct 4, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 49,126 5,797 Updated Sep 18, 2024

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Python 415 57 Updated Jan 6, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,792 5,722 Updated Mar 4, 2025

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 435 16 Updated Oct 29, 2024

[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Python 84 3 Updated Aug 22, 2024

A simulation framework based on ROS2 and LLMs(like GPT) for robot interaction tasks in the era of large models

Python 116 12 Updated May 22, 2024

SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.

Python 1,292 225 Updated Feb 28, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 130,809 10,723 Updated Mar 4, 2025

TensorFlow for macOS 11.0+ accelerated using Apple's ML Compute framework.

Shell 3,675 310 Updated Oct 31, 2021

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C++ 13,070 2,899 Updated Jan 29, 2025

Ravens partially ported code from Keras/Tensorflow to Pytorch.

Python 1 Updated Nov 25, 2022

Generating Robotic Simulation Tasks via Large Language Models

Python 312 24 Updated Mar 23, 2024

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

Python 591 99 Updated Jul 30, 2024

TCC-IRoNL is a novel framework that leverages large language models (LLMs) and multi-model vision-language models (VLMs) to enable ROS-based autonomous robots to interact with humans or other entit…

Jupyter Notebook 14 3 Updated Feb 5, 2025

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python 1,355 286 Updated Nov 5, 2024

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 512 71 Updated Feb 25, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 42,870 5,237 Updated Mar 3, 2025
Next