ParaGon: Differentiable Parsing and Visual Grounding of Human Language Instructions for Object Placement
This repository contains the pytorch implementation of the paper: Differentiable Parsing and Visual Grounding of Human Language Instructions for Object Placement.
Build docker container
python scripts/docker_build.py
Run docker container
python scripts/docker_run.py
bash scripts/data.sh
bash scripts/train.sh
bash scripts/eval.sh
bash scripts/pretrain_model.sh
If you find this work useful in your research, please cite:
@InProceedings{zhao2023paragon,
author = {Zhao, Zirui and Lee, Wee Sun and Hsu, David},
title = {Differentiable Parsing and Visual Grounding of Human Language Instructions for Object Placement},
booktitle = {Proceedings of the IEEE International Conference on Robotics and Automation},
year = {2023}
}