Reinforcement-learning-based path planning for UAVs in intensive obstacle environment

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 3
  • Captures
    • Readers: 2
see details

摘要

In intensive obstacle environment, the available flying space is narrow, which makes it difficult to generate feasible path for UAVs within limited runtime. In this paper, a Q-learning-based planning algorithm is presented to improve the efficiency of single UAV path planning in intensive obstacle environment. By constructing the space-action state offline learning planning architecture, the proposed method realizes the rapid path planning of UAV, and solves the high time-consuming problem of reinforcement learning online path planning. Considering the time-consuming problem of Q-table re-training, a probabilistic local update mechanism is proposed by updating the Q-value of the states to reduce the high time-consuming of Q-table re-raining and realize the rapid update of Q-table. The probability of Q-value updating is up to the distance to the new obstacle. The closer the state is to the new obstacle, the higher its probability of re-training. Therefore, the flight trajectory can be quickly re-planned when the environment changes. Simulation results show that the proposed Q-learning-based planning algorithm can generate path for UAV from random start position and avoid the obstacles. Compared with the classical A* algorithm, the path planning time based on the trained Q table can be reduced from second to millisecond, which significantly improves the efficiency of path planning.

源语言英语
主期刊名Proceeding - 2021 China Automation Congress, CAC 2021
出版商Institute of Electrical and Electronics Engineers Inc.
6451-6455
页数5
ISBN(电子版)9781665426473
DOI
出版状态已出版 - 2021
活动2021 China Automation Congress, CAC 2021 - Beijing, 中国
期限: 22 10月 202124 10月 2021

出版系列

姓名Proceeding - 2021 China Automation Congress, CAC 2021

会议

会议2021 China Automation Congress, CAC 2021
国家/地区中国
Beijing
时期22/10/2124/10/21

指纹

探究 'Reinforcement-learning-based path planning for UAVs in intensive obstacle environment' 的科研主题。它们共同构成独一无二的指纹。

引用此

Guo, M., Long, T., Li, H., & Sun, J. (2021). Reinforcement-learning-based path planning for UAVs in intensive obstacle environment. 在 Proceeding - 2021 China Automation Congress, CAC 2021 (页码 6451-6455). (Proceeding - 2021 China Automation Congress, CAC 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/CAC53003.2021.9727746