TY - JOUR
T1 - 基于强化学习冲突消解的大规模无人机集群航迹规划方法
AU - Zhou, Zhenlin
AU - Long, Teng
AU - Liu, Dawei
AU - Sun, Jingliang
AU - Zhong, Jianxin
AU - Li, Junzhi
N1 - Publisher Copyright:
© 2025 China Ordnance Industry Corporation. All rights reserved.
PY - 2025/5/31
Y1 - 2025/5/31
N2 - In the context of large-scale unmanned aerial vehicle (UAV) swarm cooperative flight scenarios, the high computational time consumption in swarm path planning is caused by frequent path conflicts. Aiming at the problem above, a large-scale UAV swarm path planning method based on reinforcement learning conflict resolution is developed. A dual-layer planning architecture, comprising a high-level layer of conflict resolution and a low-level layer of path planning, is constructed to reduce the spatial and temporal dimensions of path conflicts. At the high-level layer of conflict resolution, a conflict resolution strategy network based on the Rainbow deep Q-networks (DQN) algorithm training framework is designed. This network transforms the resolution process of each path conflict into the action selection process of left and right tree nodes of a binary tree. This approach maps different conflict resolution sequences to their outcomes, thereby reducing the traversal of tree nodes and improving the efficiency of conflict resolution. At the low-level layer of path planning, the time dimension is incorporated into the spatial collision avoidance strategy. A re-planning jump point search (ReJPS) method based on a node re-expansion mechanism is proposed, which increases the feasible planning domain and enhances the ability to resolve the path conflicts. Simulated results indicate that, compared to the path planning methods based on the conflict-based search (CBS) + A* and CBS + ReJPS, the proposed method reduces the average planning time by 86. 64% and 19. 65%, respectively, while maintaining comparable optimality.
AB - In the context of large-scale unmanned aerial vehicle (UAV) swarm cooperative flight scenarios, the high computational time consumption in swarm path planning is caused by frequent path conflicts. Aiming at the problem above, a large-scale UAV swarm path planning method based on reinforcement learning conflict resolution is developed. A dual-layer planning architecture, comprising a high-level layer of conflict resolution and a low-level layer of path planning, is constructed to reduce the spatial and temporal dimensions of path conflicts. At the high-level layer of conflict resolution, a conflict resolution strategy network based on the Rainbow deep Q-networks (DQN) algorithm training framework is designed. This network transforms the resolution process of each path conflict into the action selection process of left and right tree nodes of a binary tree. This approach maps different conflict resolution sequences to their outcomes, thereby reducing the traversal of tree nodes and improving the efficiency of conflict resolution. At the low-level layer of path planning, the time dimension is incorporated into the spatial collision avoidance strategy. A re-planning jump point search (ReJPS) method based on a node re-expansion mechanism is proposed, which increases the feasible planning domain and enhances the ability to resolve the path conflicts. Simulated results indicate that, compared to the path planning methods based on the conflict-based search (CBS) + A* and CBS + ReJPS, the proposed method reduces the average planning time by 86. 64% and 19. 65%, respectively, while maintaining comparable optimality.
KW - conflict resolution
KW - conflict-based search
KW - deep reinforcement learning
KW - path planning
KW - unmanned aerial vehicle swarm
UR - http://www.scopus.com/inward/record.url?scp=105007035580&partnerID=8YFLogxK
U2 - 10.12382/bgxb.2024.1146
DO - 10.12382/bgxb.2024.1146
M3 - 文章
AN - SCOPUS:105007035580
SN - 1000-1093
VL - 46
JO - Binggong Xuebao/Acta Armamentarii
JF - Binggong Xuebao/Acta Armamentarii
IS - 5
M1 - 241146
ER -