A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching

Jing Zhao, Chao Yang*, Weida Wang, Bin Xu, Ying Li, Liuquan Yang, Hua Zhu, Changle Xiang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

22 引用 (Scopus)

摘要

Numerous missions in both civil and military fields involve the pursuit-evasion problem of vehicles. With vertical take-off and landing capability, the intelligent air-ground vehicle expands its feasible path to 3-D space, which has great advantages in the pursuit. This vehicle requires adequate path planning to obtain an optimal 3-D path and further improve the pursuit efficiency. The planning process of the air-ground vehicle currently faces the technical difficulties of acquiring the proper takeoff timing and position while optimizing the planning trajectory, especially in a complex environment with dense obstacles. To solve the above issues, a game-learning-based smooth path planning strategy for the intelligent air-ground vehicle considering mode switching is proposed in this article. First, a new reward function of the Q-learning algorithm, considering the influence of flight obstacle crossing parameters, is presented to explore the short forward track distance. Second, in the update rule, the pursuit-evasion game acts in the mode switching decisions. During interactive learning between the vehicle and environment, this game constantly updates the Nash equilibrium solutions for mode switching and gets a series of switching decisions of the pursuer vehicle (ego vehicle). Third, a double-yaw correction for path smoothing modification is proposed to reduce turning points and avoid local path deviations. This modification provides heuristic information for the exploration of the environment, which significantly speeds up the convergence speed of the algorithm. Finally, the proposed strategy is verified on a 1000 m∗1000 m map with 0-200 m obstacle height. Results show that this strategy is effective to decrease the 253-m distance compared with the traditional reinforcement learning algorithm and has a faster convergence speed. The number of trajectory direction changes is 36% less than that of the game-learning algorithm only considering mode switching. The unreasonable large angle turns are eliminated.

源语言英语
页(从-至)3349-3366
页数18
期刊IEEE Transactions on Transportation Electrification
8
3
DOI
出版状态已出版 - 1 9月 2022

指纹

探究 'A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching' 的科研主题。它们共同构成独一无二的指纹。

引用此