TY - JOUR
T1 - Hybrid bidirectional rapidly exploring random tree path planning algorithm with reinforcement learning
AU - Wang, Junkui
AU - Hirota, Kaoru
AU - Wu, Xiangdong
AU - Dai, Yaping
AU - Jia, Zhiyang
N1 - Publisher Copyright:
© 2021 Fuji Technology Press. All rights reserved.
PY - 2021/1/20
Y1 - 2021/1/20
N2 - The randomness of path generation and slow convergence to the optimal path are two major problems in the current rapidly exploring random tree (RRT) path planning algorithm. Herein, a novel reinforcement-learning-based hybrid bidirectional rapidly exploring random tree (H-BRRT) is presented to solve these problems. To model the random exploration process, a target gravitational strategy is introduced. Reinforcement learning is applied to the improved target gravitational strategy using two operations: random exploration and target gravitational exploration. The algorithm is controlled to switch operations adaptively according to the accumulated performance. It not only improves the search efficiency, but also shortens the generated path after the proposed strategy is applied to a bidirectional rapidly exploring random tree (BRRT). In addition, to solve the problem of the traditional RRT continuously falling into the local optimum, an improved exploration strategy with collision weight is applied to the BRRT. Experimental results implemented in a robot operating system indicate that the proposed H-BRRT significantly outperforms alternative approaches such as the RRT and BRRT. The proposed algorithm enhances the capability of identifying unknown spaces and avoiding local optima.
AB - The randomness of path generation and slow convergence to the optimal path are two major problems in the current rapidly exploring random tree (RRT) path planning algorithm. Herein, a novel reinforcement-learning-based hybrid bidirectional rapidly exploring random tree (H-BRRT) is presented to solve these problems. To model the random exploration process, a target gravitational strategy is introduced. Reinforcement learning is applied to the improved target gravitational strategy using two operations: random exploration and target gravitational exploration. The algorithm is controlled to switch operations adaptively according to the accumulated performance. It not only improves the search efficiency, but also shortens the generated path after the proposed strategy is applied to a bidirectional rapidly exploring random tree (BRRT). In addition, to solve the problem of the traditional RRT continuously falling into the local optimum, an improved exploration strategy with collision weight is applied to the BRRT. Experimental results implemented in a robot operating system indicate that the proposed H-BRRT significantly outperforms alternative approaches such as the RRT and BRRT. The proposed algorithm enhances the capability of identifying unknown spaces and avoiding local optima.
KW - Path planning
KW - Q-learning
KW - Rapidly exploring random tree
KW - Reinforcement learning
KW - Target gravitational strategy
UR - http://www.scopus.com/inward/record.url?scp=85100461845&partnerID=8YFLogxK
U2 - 10.20965/JACIII.2021.P0121
DO - 10.20965/JACIII.2021.P0121
M3 - Article
AN - SCOPUS:85100461845
SN - 1343-0130
VL - 25
SP - 121
EP - 129
JO - Journal of Advanced Computational Intelligence and Intelligent Informatics
JF - Journal of Advanced Computational Intelligence and Intelligent Informatics
IS - 1
ER -