This is the official impletations of the EMNLP Findings paper, VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatial-Temporal Reasoning

Jupyter Notebook 18 Updated Nov 15, 2024

Jianghanxiao / RoboEXP

[CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation

Python 95 5 Updated Oct 8, 2024

naokiyokoyama / ovon

Open Vocabulary Object Navigation

Python 70 7 Updated Feb 22, 2025

cshizhe / onav_rim

Python 37 2 Updated Sep 30, 2023

Ram81 / habitat-imitation-baselines

Code for training embodied agents using imitation learning at scale in Habitat-Lab

Python 39 7 Updated Apr 13, 2025

haosulab / ManiSkill

SAPIEN Manipulation Skill Framework, an open source GPU parallelized robotics simulator and benchmark, led by Hillbot, Inc.

Python 1,487 257 Updated Apr 12, 2025

Junggy / SCRREAM

Python 15 Updated Dec 30, 2024

javokhirajabov / quadwbg

QuadWBG: Generalizable Quadrupedal Whole-Body Grasping

21 Updated Nov 8, 2024

oliver-lemke / spot-compose

A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds

Python 26 7 Updated Jan 19, 2025

video-to-action / video-to-action-release

Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".

Python 44 3 Updated Dec 27, 2024

ymxlzgy / echoscene

[ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.

Python 92 5 Updated Jun 3, 2024

lorenmt / clarity-template

Clarity: A Minimalist Website Template for AI Research

CSS 110 14 Updated Jan 12, 2025

maitrix-org / Pandora

Pandora: Towards General World Model with Natural Language Actions and Video States

Python 502 35 Updated Sep 23, 2024

alshedivat / al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 12,748 11,730 Updated Apr 14, 2025

UMass-Embodied-AGI / MultiPLY

Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World

Python 128 6 Updated Oct 24, 2024

yilundu / ired_code_release

Python 59 6 Updated Jun 14, 2024

OpenMeshLab / MeshXL

[NeurIPS 2024] MeshXL: Neural Coordinate Field for Generative 3D Foundation Models, a 3D fundamental model for mesh generation

Python 311 11 Updated Apr 4, 2025

ymxlzgy ymxlzgy

Lists (6)

dataset

diffusion

grasping

lm

scene graph

simulation

Stars