A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching

Jing Zhao; Chao Yang; Weida Wang; Bin Xu; Ying Li; Liuquan Yang; Hua Zhu; Changle Xiang

doi:10.1109/TTE.2022.3142150

A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching

Jing Zhao, Chao Yang^*, Weida Wang, Bin Xu, Ying Li, Liuquan Yang, Hua Zhu, Changle Xiang

^*此作品的通讯作者

机械与车辆学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

22 引用（Scopus）

摘要

Numerous missions in both civil and military fields involve the pursuit-evasion problem of vehicles. With vertical take-off and landing capability, the intelligent air-ground vehicle expands its feasible path to 3-D space, which has great advantages in the pursuit. This vehicle requires adequate path planning to obtain an optimal 3-D path and further improve the pursuit efficiency. The planning process of the air-ground vehicle currently faces the technical difficulties of acquiring the proper takeoff timing and position while optimizing the planning trajectory, especially in a complex environment with dense obstacles. To solve the above issues, a game-learning-based smooth path planning strategy for the intelligent air-ground vehicle considering mode switching is proposed in this article. First, a new reward function of the Q-learning algorithm, considering the influence of flight obstacle crossing parameters, is presented to explore the short forward track distance. Second, in the update rule, the pursuit-evasion game acts in the mode switching decisions. During interactive learning between the vehicle and environment, this game constantly updates the Nash equilibrium solutions for mode switching and gets a series of switching decisions of the pursuer vehicle (ego vehicle). Third, a double-yaw correction for path smoothing modification is proposed to reduce turning points and avoid local path deviations. This modification provides heuristic information for the exploration of the environment, which significantly speeds up the convergence speed of the algorithm. Finally, the proposed strategy is verified on a 1000 m∗1000 m map with 0-200 m obstacle height. Results show that this strategy is effective to decrease the 253-m distance compared with the traditional reinforcement learning algorithm and has a faster convergence speed. The number of trajectory direction changes is 36% less than that of the game-learning algorithm only considering mode switching. The unreasonable large angle turns are eliminated.

源语言	英语
页（从-至）	3349-3366
页数	18
期刊	IEEE Transactions on Transportation Electrification
卷	8
期	3
DOI	https://doi.org/10.1109/TTE.2022.3142150
出版状态	已出版 - 1 9月 2022

访问文件

10.1109/TTE.2022.3142150

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{4490cebde78d4dba94b2771afe6f1f12,

title = "A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching",

abstract = "Numerous missions in both civil and military fields involve the pursuit-evasion problem of vehicles. With vertical take-off and landing capability, the intelligent air-ground vehicle expands its feasible path to 3-D space, which has great advantages in the pursuit. This vehicle requires adequate path planning to obtain an optimal 3-D path and further improve the pursuit efficiency. The planning process of the air-ground vehicle currently faces the technical difficulties of acquiring the proper takeoff timing and position while optimizing the planning trajectory, especially in a complex environment with dense obstacles. To solve the above issues, a game-learning-based smooth path planning strategy for the intelligent air-ground vehicle considering mode switching is proposed in this article. First, a new reward function of the Q-learning algorithm, considering the influence of flight obstacle crossing parameters, is presented to explore the short forward track distance. Second, in the update rule, the pursuit-evasion game acts in the mode switching decisions. During interactive learning between the vehicle and environment, this game constantly updates the Nash equilibrium solutions for mode switching and gets a series of switching decisions of the pursuer vehicle (ego vehicle). Third, a double-yaw correction for path smoothing modification is proposed to reduce turning points and avoid local path deviations. This modification provides heuristic information for the exploration of the environment, which significantly speeds up the convergence speed of the algorithm. Finally, the proposed strategy is verified on a 1000 m∗1000 m map with 0-200 m obstacle height. Results show that this strategy is effective to decrease the 253-m distance compared with the traditional reinforcement learning algorithm and has a faster convergence speed. The number of trajectory direction changes is 36% less than that of the game-learning algorithm only considering mode switching. The unreasonable large angle turns are eliminated.",

keywords = "Air-ground vehicle, Q reinforcement learning (RL), mode switching, path planning, path smoothness, pursuit-evasion game",

author = "Jing Zhao and Chao Yang and Weida Wang and Bin Xu and Ying Li and Liuquan Yang and Hua Zhu and Changle Xiang",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.",

year = "2022",

month = sep,

day = "1",

doi = "10.1109/TTE.2022.3142150",

language = "English",

volume = "8",

pages = "3349--3366",

journal = "IEEE Transactions on Transportation Electrification",

issn = "2332-7782",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "3",

}

TY - JOUR

T1 - A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching

AU - Zhao, Jing

AU - Yang, Chao

AU - Wang, Weida

AU - Xu, Bin

AU - Li, Ying

AU - Yang, Liuquan

AU - Zhu, Hua

AU - Xiang, Changle

PY - 2022/9/1

Y1 - 2022/9/1

N2 - Numerous missions in both civil and military fields involve the pursuit-evasion problem of vehicles. With vertical take-off and landing capability, the intelligent air-ground vehicle expands its feasible path to 3-D space, which has great advantages in the pursuit. This vehicle requires adequate path planning to obtain an optimal 3-D path and further improve the pursuit efficiency. The planning process of the air-ground vehicle currently faces the technical difficulties of acquiring the proper takeoff timing and position while optimizing the planning trajectory, especially in a complex environment with dense obstacles. To solve the above issues, a game-learning-based smooth path planning strategy for the intelligent air-ground vehicle considering mode switching is proposed in this article. First, a new reward function of the Q-learning algorithm, considering the influence of flight obstacle crossing parameters, is presented to explore the short forward track distance. Second, in the update rule, the pursuit-evasion game acts in the mode switching decisions. During interactive learning between the vehicle and environment, this game constantly updates the Nash equilibrium solutions for mode switching and gets a series of switching decisions of the pursuer vehicle (ego vehicle). Third, a double-yaw correction for path smoothing modification is proposed to reduce turning points and avoid local path deviations. This modification provides heuristic information for the exploration of the environment, which significantly speeds up the convergence speed of the algorithm. Finally, the proposed strategy is verified on a 1000 m∗1000 m map with 0-200 m obstacle height. Results show that this strategy is effective to decrease the 253-m distance compared with the traditional reinforcement learning algorithm and has a faster convergence speed. The number of trajectory direction changes is 36% less than that of the game-learning algorithm only considering mode switching. The unreasonable large angle turns are eliminated.

AB - Numerous missions in both civil and military fields involve the pursuit-evasion problem of vehicles. With vertical take-off and landing capability, the intelligent air-ground vehicle expands its feasible path to 3-D space, which has great advantages in the pursuit. This vehicle requires adequate path planning to obtain an optimal 3-D path and further improve the pursuit efficiency. The planning process of the air-ground vehicle currently faces the technical difficulties of acquiring the proper takeoff timing and position while optimizing the planning trajectory, especially in a complex environment with dense obstacles. To solve the above issues, a game-learning-based smooth path planning strategy for the intelligent air-ground vehicle considering mode switching is proposed in this article. First, a new reward function of the Q-learning algorithm, considering the influence of flight obstacle crossing parameters, is presented to explore the short forward track distance. Second, in the update rule, the pursuit-evasion game acts in the mode switching decisions. During interactive learning between the vehicle and environment, this game constantly updates the Nash equilibrium solutions for mode switching and gets a series of switching decisions of the pursuer vehicle (ego vehicle). Third, a double-yaw correction for path smoothing modification is proposed to reduce turning points and avoid local path deviations. This modification provides heuristic information for the exploration of the environment, which significantly speeds up the convergence speed of the algorithm. Finally, the proposed strategy is verified on a 1000 m∗1000 m map with 0-200 m obstacle height. Results show that this strategy is effective to decrease the 253-m distance compared with the traditional reinforcement learning algorithm and has a faster convergence speed. The number of trajectory direction changes is 36% less than that of the game-learning algorithm only considering mode switching. The unreasonable large angle turns are eliminated.

KW - Air-ground vehicle

KW - Q reinforcement learning (RL)

KW - mode switching

KW - path planning

KW - path smoothness

KW - pursuit-evasion game

UR - http://www.scopus.com/inward/record.url?scp=85123279380&partnerID=8YFLogxK

U2 - 10.1109/TTE.2022.3142150

DO - 10.1109/TTE.2022.3142150

M3 - Article

AN - SCOPUS:85123279380

SN - 2332-7782

VL - 8

SP - 3349

EP - 3366

JO - IEEE Transactions on Transportation Electrification

JF - IEEE Transactions on Transportation Electrification

IS - 3

ER -

A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching

摘要

访问文件

其它文件与链接

指纹

引用此