-
3D-VQA Public
Forked from AlexDelitzas/3D-VQACode of the paper "CLIP-Guided Vision-Language Pre-training for Question Answering in 3D Scenes" (CVPRW 2023)
Python UpdatedMar 26, 2024 -
-
3DVL_Codebase Public
Forked from zlccccc/3DVL_Codebase[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
Python Other UpdatedDec 19, 2023 -
SQA3D Public
Forked from SilongYong/SQA3D[ICLR 2023] SQA3D for embodied scene understanding and reasoning
Python Apache License 2.0 UpdatedDec 6, 2023 -
mcan-vqa Public
Forked from MILVLG/mcan-vqaDeep Modular Co-Attention Networks for Visual Question Answering
Python Apache License 2.0 UpdatedNov 16, 2023