Paper (Accepted by IEEE RAL 2024)
Fig.1. An illustration of online time-informed kinodynamic motion planning of nonlinear control systems. (a) A Deep Invertible Koopman operator with control U model, DIKU, is trained offline for the nonlinear systems to obtain equivalent linear systems that enable forward and backward dynamics prediction in the lifted space. (b) Our algorithm randomly samples states in the start and goal sets. It then uses the ASKU to bidirectionally propagate the learned linear dynamics and then recover, generating the forward reachable set
Fig. 2. An overview of deep invertible Koopman operator with control (DIKU) neural network model for long-horizon forward and backward dynamics prediction
Fig. 3. Comparison of forward and backward dynamics prediction by our DIKU and the DKU with consistency loss
Computation times of the Forward/Backward reachable tubes for the example 2D system in [1]
Level Set | Ellipsoidal Toolbox | RHJB [1] | Ours | Ours (AS) | |
---|---|---|---|---|---|
Forward Reachable Tube | 47.46 | 51.13 | 22.17 | 0.03 | 0.16 |
Backward Reachable Tube | 5.71 | 6.07 | 23.31 | 0.03 | 0.16 |
Fig. 4. Comparison results of the example 2D system [1] (left) forward reachable sets and tube and (right) the backward counterparts of TIS computed by the level set toolbox (Ground truth, black), ellipsoidal toolbox (ET, cyan), relaxed HJB equation method [1] (RHJB, purple), our basic sampling-based convex approximation (Ours, red), and our adversarial sampling for over-approximation, ASKU, (Ours(AS), green). Our method provides inner-approximated and tight over-approximated TISs online.
Fig. 5 Planning solutions (black lines) of the (a) 2D-L, (b) 3D-PNL, (c) CartPole, (d) DampingPendulum, (e) Two-link acrobot, and (f) planar quadrotor systems facing obstacles (blue boxes) given a start (green point) and goal (red pentagram) states. For (c)-(e), both the workspace and state-space trajectories are shown for each method. The orange points are samples. It can be seen that our online time-informed SKMP has a restricted search domain, indicating our ASKU is generalizable and scalable to nonlinear systems and benefits heuristic sampling. The planning efficiency is then improved.
pytorch, gym, jupyter, scipy, matplotlib, numpy, cvxpy, dill, tensorboard, pybullet, sklearn
cd Deepkoopman/train/
bash train.sh
to generate training trajectory datasets and train the networks (feel free to ask for the dataset used in this paper if you need).
After obtaining the DIKU neural network models, you can use the Comparison_DIKU_prediction.ipynb in folder Deepkoopman/ to evaluate the bidirectional prediction performance as shown in Fig. 3.
To evaluate our ASKU method, please
cd ASKU
run Online_TIS.ipynb
to reproduce Fig. 4.
Sampling-based Reachability Analysis: A Random Set Theory Approach with Adversarial Sampling
@article{meng2024online,
title={Online Time-Informed Kinodynamic Motion Planning of Nonlinear Systems},
author={Meng, Fei and Liu, Jianbang and Shi, Haojie and Ma, Han and Ren, Hongliang and Meng, Max Q-H},
journal={IEEE Robotics and Automation Letters},
year={2024},
publisher={IEEE}
}
[1] Tang, Yongxing, Zhanxia Zhu, and Hongwen Zhang. "A Reachability-Based Spatio-Temporal Sampling Strategy for Kinodynamic Motion Planning." IEEE Robotics and Automation Letters 8.1 (2022): 448-455.