The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 49,126 5,797 Updated Sep 18, 2024

alfworld / alfworld

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

Python 415 57 Updated Jan 6, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,792 5,722 Updated Mar 4, 2025

UMass-Embodied-AGI / 3D-VLA

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 435 16 Updated Oct 29, 2024

SiyuanHuang95 / ManipVQA

[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Python 84 3 Updated Aug 22, 2024

NoneJou072 / robochain

A simulation framework based on ROS2 and LLMs(like GPT) for robot interaction tasks in the era of large models

Python 116 12 Updated May 22, 2024

haosulab / ManiSkill

SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.

Python 1,292 225 Updated Feb 28, 2025

ollama / ollama

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 130,809 10,723 Updated Mar 4, 2025

apple / tensorflow_macos

TensorFlow for macOS 11.0+ accelerated using Apple's ML Compute framework.

Shell 3,675 310 Updated Oct 31, 2021

bulletphysics / bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C++ 13,070 2,899 Updated Jan 29, 2025

Khoronus / ravens_pytorch

Ravens partially ported code from Keras/Tensorflow to Pytorch.

Python 1 Updated Nov 25, 2022

liruiw / GenSim

Generating Robotic Simulation Tasks via Large Language Models

Python 312 24 Updated Mar 23, 2024

google-research / ravens

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

Python 591 99 Updated Jul 30, 2024

LinusNEP / TCC-IRoNL

TCC-IRoNL is a novel framework that leverages large language models (LLMs) and multi-model vision-language models (VLMs) to enable ROS-based autonomous robots to interact with humans or other entit…

Jupyter Notebook 14 3 Updated Feb 5, 2025

Farama-Foundation / Metaworld

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning

Python 1,355 286 Updated Nov 5, 2024

simpler-env / SimplerEnv

Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)

Jupyter Notebook 512 71 Updated Feb 25, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 42,870 5,237 Updated Mar 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Russal yang-zhikai

Block or report yang-zhikai

Starred repositories

yufeiwang63 / RL-VLM-F

om-ai-lab / VLM-R1

Lifelong-Robot-Learning / LIBERO

lsdefine / simple_GRPO

MichalZawalski / embodied-CoT

Deep-Agent / R1-V

hkust-nlp / simpleRL-reason

huggingface / lerobot

AgibotTech / agibot_x1_train

HuangLK / transpeeder

declare-lab / Emma-X

Xinji-Mai / qwen2VLQuickStart

stevenyangyj / Emma-Alfworld

facebookresearch / segment-anything