ByZ0e

Ziyi Bai ByZ0e

Multimodal Learning, Embodied AI, Phd student at ICT,CAS. [email protected]

5 followers · 12 following

Beijing, China
https://scholar.google.com/citations?user=jRe11usAAAAJ&hl=zh-CN
https://orcid.org/0009-0004-5917-5400

Achievements

Stars

ros-teleop / teleop_twist_keyboard

Generic Keyboard Teleop for ROS

Python 268 405 Updated Jun 28, 2023

allenai / spoc-robot-training

SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World

Python 117 8 Updated Nov 4, 2024

allenai / PoliFormer

PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Python 75 2 Updated Nov 21, 2024

LYX0501 / InstructNav

Python 130 7 Updated Mar 29, 2025

robodhruv / visualnav-transformer

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

Python 807 110 Updated Sep 15, 2024

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 2,628 332 Updated Mar 23, 2025

matterport / habitat-matterport-3dresearch

432 36 Updated Mar 25, 2023

bdaiinstitute / vlfm

The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)

Python 416 43 Updated Jan 7, 2025

kyegomez / RT-2

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Python 446 63 Updated Jul 26, 2024

ActiveVisionLab / Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

1,625 99 Updated Apr 16, 2025

notmahi / dobb-e

Dobb·E: An open-source, general framework for learning household robotic manipulation

G-code 596 55 Updated Oct 15, 2024

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,000 705 Updated Apr 23, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

14,840 951 Updated Apr 24, 2025

315386775 / DeepLearing-Interview-Awesome-2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓，同时包含工作和科研过程中的新想法、新问题、新资源与新项目

2,301 212 Updated Apr 18, 2025

H-Freax / Awesome-Video-Robotic-Papers

This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.

153 6 Updated Jan 30, 2025

stepjam / RLBench

A large-scale benchmark and learning environment.

Python 1,366 268 Updated Jan 25, 2025

askforalfred / alfred

ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

C 415 88 Updated Apr 23, 2025

Gabesarch / HELPER

Python 29 3 Updated Sep 22, 2024

Farama-Foundation / Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Python 2,224 621 Updated Feb 6, 2025

andreykurenkov / modeling_env_dynamics

Python 19 3 Updated May 30, 2023

embodied-generalist / embodied-generalist

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 436 39 Updated Apr 20, 2025

M4U-Benchmark / M4U

Code for the Paper M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models.

Python 8 Updated Mar 11, 2025

ByZ0e / AI2Thor_keyboard_player

AI2-THOR Data Collection Tool Based On Keyboard Interaction

Python 49 10 Updated Jun 21, 2024

ByZ0e / HSTT

Python 4 Updated Jul 16, 2024

1989Ryan / llm-mcts

[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling better-reasoned decision-making for daily task planning problems.

Python 265 24 Updated Nov 16, 2024

eric-ai-lab / awesome-vision-language-navigation

A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"

475 23 Updated May 2, 2024

EmbodiedGPT / EgoCOT_Dataset

48 Updated Apr 4, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 24,845 2,184 Updated Apr 24, 2025

UMass-Embodied-AGI / MultiPLY

Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World

Python 128 6 Updated Oct 24, 2024

CeeZh / LLoVi

Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"

Python 94 4 Updated Oct 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ziyi Bai ByZ0e

Achievements

Achievements

Block or report ByZ0e

Stars

ros-teleop / teleop_twist_keyboard

allenai / spoc-robot-training

allenai / PoliFormer

LYX0501 / InstructNav

robodhruv / visualnav-transformer

openvla / openvla

matterport / habitat-matterport-3dresearch

bdaiinstitute / vlfm

kyegomez / RT-2

ActiveVisionLab / Awesome-LLM-3D

notmahi / dobb-e

QwenLM / Qwen2.5-VL

BradyFU / Awesome-Multimodal-Large-Language-Models

315386775 / DeepLearing-Interview-Awesome-2024

H-Freax / Awesome-Video-Robotic-Papers

stepjam / RLBench

askforalfred / alfred

Gabesarch / HELPER

Farama-Foundation / Minigrid

andreykurenkov / modeling_env_dynamics

embodied-generalist / embodied-generalist

M4U-Benchmark / M4U

ByZ0e / AI2Thor_keyboard_player

ByZ0e / HSTT

1989Ryan / llm-mcts

eric-ai-lab / awesome-vision-language-navigation

EmbodiedGPT / EgoCOT_Dataset

Genesis-Embodied-AI / Genesis

UMass-Embodied-AGI / MultiPLY

CeeZh / LLoVi