PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation

Evaluation Results

See Evaluation Pipeline

Installation

One way is to set up the environment with docker. See this.

Another way is to use the following commands.

Click me to expand ...

1. Set up the python environment: ``` conda create -n pvnet python=3.7 conda activate pvnet

# install torch 1.1 built from cuda 9.0
pip install torch==1.1.0 -f https://download.pytorch.org/whl/cu90/stable

pip install Cython==0.28.2
sudo apt-get install libglfw3-dev libglfw3
pip install -r requirements.txt
```

Compile cuda extensions under lib/csrc:

ROOT=/path/to/clean-pvnet
cd $ROOT/lib/csrc
export CUDA_HOME="/usr/local/cuda-9.0"
cd ransac_voting
python setup.py build_ext --inplace
cd ../nn
python setup.py build_ext --inplace
cd ../fps
python setup.py build_ext --inplace

# If you want to run PVNet with a detector
cd ../dcn_v2
python setup.py build_ext --inplace

# If you want to use the uncertainty-driven PnP
cd ../uncertainty_pnp
sudo apt-get install libgoogle-glog-dev
sudo apt-get install libsuitesparse-dev
sudo apt-get install libatlas-base-dev
python setup.py build_ext --inplace

Set up datasets:

ROOT=/path/to/clean-pvnet
cd $ROOT/data
ln -s /path/to/linemod linemod
ln -s /path/to/linemod_orig linemod_orig
ln -s /path/to/occlusion_linemod occlusion_linemod

# the following is used for tless
ln -s /path/to/tless tless
ln -s /path/to/cache cache
ln -s /path/to/SUN2012pascalformat sun

Training on the custom object

The training parameters can be found in project_structure.md.

Create a synthetic dataset with mask and pose annotation using Blenderproc2 project(another deliverable named as syn_data_generation_pipeline). For your convenience, three training datasets and three testing dataset for each category(mainshell, topshell and insert_mold) are already included in data/FIT folder. Each training dataset consists of 20,000 images and testing dataset consists of 2,000 images.

Organize the dataset as the following structure:

├── /path/to/dataset
│   ├── model.ply # copy corresponding CAD from exsiting datasets to here
│   ├── rgb/
│   │   ├── 0.jpg
│   │   ├── ...
│   │   ├── 1234.jpg
│   │   ├── ...
│   ├── mask/
│   │   ├── 0.png
│   │   ├── ...
│   │   ├── 1234.png
│   │   ├── ...
│   ├── pose/
│   │   ├── pose0.npy
│   │   ├── ...
│   │   ├── pose1234.npy
│   │   ├── ...
│   │   └──
│   ├── k/
│   │   ├── k0.npy
│   │   ├── ...
│   │   ├── k1234.npy
│   │   ├── ...
│   │   └──

Create a soft link pointing to the training and testing dataset:
```
ln -s /path/to/dataset data/category
```
insert_mold:
```
ln -s /pvnet/data/FIT/insert_mold_train data/insert_mold_train
ln -s /pvnet/data/FIT/insert_mold_test data/insert_mold_test
```
If you use different name convention(not **_train or **_test), remember to modify customized soft path for you customized dataset here.
Copy the corresponding CAD model exported as model.ply to the root folder of the datasets(both training and testing dataset). Run the following two commands, these will create train.json and fps.txt for training and testing datasets respectively:
```
python run.py --type insert_mold
python run.py --type insert_mold_test
```

Visualize Pose Ground Truth of training dataset

python run.py --type visualize_gt --cfg_file configs/insert_mold.yaml

Modify training setting(e.g. epoch, lr ...) in insert_mold.yaml, and run the command:
```
python train_net.py --cfg_file configs/insert_mold.yaml train.batch_size 32
```
Watch the training curve:
```
tensorboard --logdir data/record/pvnet
```

Visualize:

python run.py --type visualize --cfg_file configs/insert_mold.yaml
python run.py --type visualize_train --cfg_file configs/insert_mold.yaml

Test:

python run.py --type evaluate --cfg_file configs/insert_mold.yaml test.un_pnp True

Inference with Trained Models

Prequisites

Once the PVNet models, have been trained Mask R-CNN and PVNet is used to estimate the pose of each part. To perform picking with the robot we need to calibrate the camera in the robot's base frame. To get the T_camera_in_base do the following:
- In the ur5e_collab_ws, follow the insturctions in the readme for Extrinsic claibration.
- Save T_camera_in_base as a 4x4 matrix in this directory e.g. ${current directory}/T_camera_in_base.npy.
Our current pipeline, assumes that the parts are lying on an this april tagboard. Therefor the pose of the tagboard in the camera frame is needed.
- Take a picture of the tagboard in the workspace
- Open Estimate_T_tagboard_in_camera.ipynb and update the IMG_PATH and run the notebook
- This will produce T_tagboard_in_cam.npy in the pvnet repo.

Pose estimation

The command below uses a provided dataset in data/11_23_image_dataset this contains a T_camera_in_base.npy and T_tagboard_in_camera.npy.

python cobot_pvnet.py --type inference

Pose estimation with ROS

The command below can be used in conjuction with the vision based picking instructions in the ur5e_collab_ws repo's readme.

source devel/setup.bash # Run this EVERY time you start the container
python cobot_pipeline.py --type inference

Citation

If you find this code useful for your research, please use the following BibTeX entry.

@inproceedings{peng2019pvnet,
  title={PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation},
  author={Peng, Sida and Liu, Yuan and Huang, Qixing and Zhou, Xiaowei and Bao, Hujun},
  booktitle={CVPR},
  year={2019}
}

Acknowledgement

This work is affliated with ZJU-SenseTime Joint Lab of 3D Vision, and its intellectual property belongs to SenseTime Group Ltd.

Copyright (c) ZJU-SenseTime Joint Lab of 3D Vision. All Rights Reserved.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Name		Name	Last commit message	Last commit date
Latest commit History 347 Commits
assets		assets
configs		configs
data		data
demo_images/cat		demo_images/cat
docker		docker
lib		lib
mrcnn		mrcnn
src		src
tools		tools
.gitignore		.gitignore
Estimate_T_tagboard_in_camera.ipynb		Estimate_T_tagboard_in_camera.ipynb
LICENSE		LICENSE
README.md		README.md
T_tagboard_in_base_sanity_check.ipynb		T_tagboard_in_base_sanity_check.ipynb
T_tagboard_in_base_sanity_check.py		T_tagboard_in_base_sanity_check.py
brute_force_stable_pose.ipynb		brute_force_stable_pose.ipynb
cobot_pipeline.py		cobot_pipeline.py
cobot_pvnet.py		cobot_pvnet.py
end2end_evaluation.ipynb		end2end_evaluation.ipynb
evaluation.ipynb		evaluation.ipynb
grasping_mapping.ipynb		grasping_mapping.ipynb
project_structure.md		project_structure.md
requirements.txt		requirements.txt
run.py		run.py
sf-10-stableposes-xygamma.ipynb		sf-10-stableposes-xygamma.ipynb
train_net.py		train_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation

Evaluation Results

Installation

Training on the custom object

Inference with Trained Models

Prequisites

Pose estimation

Pose estimation with ROS

Citation

Acknowledgement

About

Releases

Packages

Contributors 6

Languages

License

yangfei4/clean-pvnet

Folders and files

Latest commit

History

Repository files navigation

PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation

Evaluation Results

Installation

Training on the custom object

Inference with Trained Models

Prequisites

Pose estimation

Pose estimation with ROS

Citation

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages