3DVL
Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"
Code for 3D-LLM: Injecting the 3D World into Large Language Models
Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
HumanML3D: A large and diverse 3d human motion-language dataset.
Large Motion Model for Unified Multi-Modal Motion Generation
[NeurIPS 2023] FineMoGen: Fine-Grained Spatio-Temporal Motion Generation and Editing
[ECCV-20] 3D human scene interaction dataset: https://people.eecs.berkeley.edu/~zhecao/hmp/index.html
Official repo of our ECCV 2022 paper "GIMO: Gaze-Informed Human Motion Prediction in Context"
Official implementation of the NeurIPS22 paper "HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes"
This is the official implement for Human-centric Scene Understanding in 3D Large-scale Scenarios.
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.
Resolving 3D Human Pose Ambiguities with 3D Scene Constraints https://prox.is.tue.mpg.de
This is the official code for MIME: Human-Aware 3D Scene Generation (CVPR2023)
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.
Pandora: Towards General World Model with Natural Language Actions and Video States
Code for "Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers" (NeurIPS 2024)