Reproduction：AdaPtive Experience Sampling(APES)

Original paper: https://ieeexplore.ieee.org/abstract/document/9832486?casa_token=gfMcjybRnTQAAAAA:rXl2teqGPw082-Gvvm-dPBdQwPHDywXZkqGmaXp47neKSJKcbL-PXTCzux-iGJ0foba35-YC

Based on the method of this article, we perform motion planning in two-dimensional space with the aim of generating a better sampling distribution to optimize the performance of the RRTConnect sampler(based on OMPL).

Background

When a planner work baesd on RRTConnect, random sampling is an important part in this planner, Higher quality sampling can improve planner performance.because higher quality sampling enables the planner to find solution path with fewer iterations.When we know the instance of the robotic arm (start point, target point, and occupancy grid) we can transform the instance into the joint space (Figure 1)

Figure1 — Instance in joint space

the green area is clearly a better region to sample, so the problem is how to find the sutiable region, and make the region have a higher sampling probability than the other regions. When we give start, endpoint and the obstacle to APES, APES can generate this distribution, which is a Gaussian mixture model(GMM).

Installation

Environment:

Install Ubuntu 20.04

Install miniconda

Install Nvidia-Cuda

Clean up the environment:

sudo apt remove --autoremove nvidia-cuda-toolkit
sudo apt remove --autoremove nvidia-*
sudo rm -rf /usr/local/cuda*
sudo apt-get purge nvidia*
sudo apt-get update
sudo apt-get autoremove
sudo apt-get autoclean
sudo apt-key del 7fa2af80

Get Current Cuda - Example code - For actual version: https://developer.nvidia.com/cuda-downloads can find

sudo mv cuda-ubuntu2004.pin /etc/apt/preferences.d/cuda-repository-pin-600
wget https://developer.download.nvidia.com/compute/cuda/11.7.0/local_installers/cuda-repo-ubuntu2004-11-7-local_11.7.0-515.43.0
sudo dpkg -i cuda-repo-ubuntu2004-11-7-local_11.7.0-515.43.04-1_amd64.deb
sudo cp /var/cuda-repo-ubuntu2004-11-7-local/cuda-*-keyring.gpg /usr/share/keyrings/
sudo apt-get update
sudo apt-get -y install cuda

Reboot

Setup PyTorch environment

Create a virtual environment for pytorch using conda Remove the old environment if needed

conda remove --name pytorch_env
conda create -n pytorch_env -c pytorch pytorch=1.11 torchvision python=3.9

Activate the virtual environment

conda activate pytorch_env

Install the torch_geometric

conda install pyg -c pyg # -c for --channel

Test the installation in the python

python
from torch_geometric.data import Data

Install stable_baselines3, 1.6.1

https://spinningup.openai.com/en/latest/user/installation.html

install OMPL

https://ompl.kavrakilab.org/installation.html

Structure

It's the overall structure of APES, which has five important working parts(fig.2)

Figure2

Generator(Fig.3) accepts instance and output fifty weights: coefficients Wi.

Figure3

Assign coefficients Wi as weights to the 50 paths to form GMM. 50 paths have been prepared.(Fig.4)

Figure4

transfer the gmm to planner, The planner can sample based on GMM, when planner successful planning a path, the total number of planner iterations is defined as the performance of the planner, in this paper (called value) and planner output value Vi to critic.

Through learning, the critic can estimate the value Vi: and out put value estimate V^.(Fig.5)

Figure5

The critic passes the gradient information of value estimate to the generator, Through gradient information of value estimate V^, the generator can be optimized to output better coefficients in order to get a better GMM.(Fig.6)

Figure6

using

We need planning.py and utilities.py Thanks my supervisor Xi-Huang for guiding and for the functions available in planning and utilities.

Using train.py to train the neural network generator and critic, which are instantiated in model.py
following are Hyperparameter experience:

After initialization, the entropy of GMM is about -144.6, so the target entropy should be smaller than it (recommended -180 ~ -230).

A larger batchsize leads to more stable convergence of the generator (GMM entropy changes more slowly), and Epoch can be increased appropriately.

If 2. leads to too long computation time, the entropy regularization factor log_alpha can be reduced appropriately (e.g., use -12 instead of -8).

Learning rate: 1e-4 ~ 5e-4 is appropriate.

gmm_rrtc_sampler.py and rand_rrtc_planner.py are provided under the mixed Gaussian model and under random.uniform, respectively, and they both use the RRTConnect method. Using compartor.py can see the performance comparison of the two planners under a testset.

conclusion

In order to visualize which regions have higher sampling probability, I make a random sampling of 2000 points under the GMM generated by APES. I found that the density of blue points is higher in the green region, which proves that the probability of sampling in this region is higher than the other regions, and APES got a GMM we wanted. In APES, the critic transfers the gradient information of the planner's VALUE to the generator, and then the generator can learn effectively and generate an effective GMM distribution.

result

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
__pycache__		__pycache__
png		png
README.md		README.md
__init__.py		__init__.py
compartor.py		compartor.py
easy_pl_req_250_nodes.json		easy_pl_req_250_nodes.json
fixed_path_generator.py		fixed_path_generator.py
gmm_gen.py		gmm_gen.py
gmm_performance.py		gmm_performance.py
gmm_rrtc_planner.py		gmm_rrtc_planner.py
mean_new		mean_new
model.py		model.py
net.critic		net.critic
net.generator		net.generator
network_2d.py		network_2d.py
neural_visualable.py		neural_visualable.py
path_gen.py		path_gen.py
rand_rrtc_planner.py		rand_rrtc_planner.py
ref.py		ref.py
replay.py		replay.py
train.py		train.py
visualize_rrt_give_plreq.py		visualize_rrt_give_plreq.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reproduction：AdaPtive Experience Sampling(APES)

Table of Contents

Background

Installation

Structure

using

conclusion

About

Releases

Packages

Languages

KGWANG2049/apes

Folders and files

Latest commit

History

Repository files navigation

Reproduction：AdaPtive Experience Sampling(APES)

Table of Contents

Background

Installation

Structure

using

conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages