Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo, and OpenVLA) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)

Jupyter Notebook 62 9 Updated Jan 7, 2025

yuandong-tian / arXiv_recbot

A Telegram bot to recommend arXiv papers

Python 189 13 Updated Jan 2, 2025

TencentARC / SEED-Voken

SEED-Voken: A Series of Powerful Visual Tokenizers

Python 805 31 Updated Jan 3, 2025

hywang66 / LARP

Python 42 Updated Jan 2, 2025

cofe-ai / O2-MAGVIT2

Open Source Implementation of Dual Modality MAGVIT2 Tokenizer

Python 12 1 Updated Nov 26, 2024

microsoft / VidTok

a family of versatile and state-of-the-art video tokenizers.

Python 319 19 Updated Jan 4, 2025

qianqianwang68 / omnimotion

Python 2,169 125 Updated Jun 11, 2024

bytedance / 1d-tokenizer

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 625 29 Updated Nov 20, 2024

OliverRensu / FlowAR

“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with any VAE.

Python 65 2 Updated Dec 23, 2024

HeegerGao / FLIP

Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks

Jupyter Notebook 35 1 Updated Dec 12, 2024

Robot-VLAs / RoboVLMs

Python 206 6 Updated Jan 7, 2025

Stanford-ILIAD / openvla-mini

Forked from openvla/openvla

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 82 5 Updated Dec 20, 2024

OpenRobotLab / Seer

An official code repository for the paper "Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation"

Python 45 1 Updated Jan 3, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 10,526 1,362 Updated Jan 7, 2025

huang-yh / Owl

38 Updated Dec 13, 2024

bytedance / GR-MG

Official implementation of GR-MG

Python 64 5 Updated Dec 18, 2024

microsoft / CogACT

A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Python 131 7 Updated Dec 23, 2024

AILab-CVC / CV-VAE

[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 255 9 Updated Dec 4, 2024

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

Python 8,130 1,127 Updated Dec 10, 2024

pengsida / learning_research

本人的科研经验

6,197 368 Updated Jan 5, 2025

RoboUniview / RoboMM

Python 53 7 Updated Jan 2, 2025

LMD0311 / Awesome-World-Model

Collect some World Models for Autonomous Driving papers.

620 19 Updated Jan 5, 2025

TencentARC / Moto

Latent Motion Token as the Bridging Language for Robot Manipulation

Python 61 Updated Dec 8, 2024

EDiRobotics / mimictest

A simple testbed for robotics manipulation policies

Python 70 3 Updated Dec 5, 2024

ZhouYuxuanYX / MultiMax

This is the official implementation of our ICML 2024 paper "MultiMax: Sparse and Multi-Modal Attention Learning""

Python 17 Updated Jul 28, 2024

Clin0212 / HydraLoRA

[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Python 133 7 Updated Dec 3, 2024

jacobgil / pytorch-grad-cam

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 10,917 1,577 Updated Dec 17, 2024

HKUNLP / diffusion-of-thoughts

[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

Python 93 4 Updated Feb 24, 2024

openvla / openvla

Forked from TRI-ML/prismatic-vlms

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 1,653 212 Updated Dec 11, 2024