A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching

Jing Zhao; Chao Yang; Weida Wang; Bin Xu; Ying Li; Liuquan Yang; Hua Zhu; Changle Xiang

doi:10.1109/TTE.2022.3142150

A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching

Jing Zhao, Chao Yang^*, Weida Wang, Bin Xu, Ying Li, Liuquan Yang, Hua Zhu, Changle Xiang

^*Corresponding author for this work

School of Mechanical Engineering

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

27 Citations (Scopus)

Abstract

Numerous missions in both civil and military fields involve the pursuit-evasion problem of vehicles. With vertical take-off and landing capability, the intelligent air-ground vehicle expands its feasible path to 3-D space, which has great advantages in the pursuit. This vehicle requires adequate path planning to obtain an optimal 3-D path and further improve the pursuit efficiency. The planning process of the air-ground vehicle currently faces the technical difficulties of acquiring the proper takeoff timing and position while optimizing the planning trajectory, especially in a complex environment with dense obstacles. To solve the above issues, a game-learning-based smooth path planning strategy for the intelligent air-ground vehicle considering mode switching is proposed in this article. First, a new reward function of the Q-learning algorithm, considering the influence of flight obstacle crossing parameters, is presented to explore the short forward track distance. Second, in the update rule, the pursuit-evasion game acts in the mode switching decisions. During interactive learning between the vehicle and environment, this game constantly updates the Nash equilibrium solutions for mode switching and gets a series of switching decisions of the pursuer vehicle (ego vehicle). Third, a double-yaw correction for path smoothing modification is proposed to reduce turning points and avoid local path deviations. This modification provides heuristic information for the exploration of the environment, which significantly speeds up the convergence speed of the algorithm. Finally, the proposed strategy is verified on a 1000 m∗1000 m map with 0-200 m obstacle height. Results show that this strategy is effective to decrease the 253-m distance compared with the traditional reinforcement learning algorithm and has a faster convergence speed. The number of trajectory direction changes is 36% less than that of the game-learning algorithm only considering mode switching. The unreasonable large angle turns are eliminated.

Original language	English
Pages (from-to)	3349-3366
Number of pages	18
Journal	IEEE Transactions on Transportation Electrification
Volume	8
Issue number	3
DOIs	https://doi.org/10.1109/TTE.2022.3142150
Publication status	Published - 1 Sept 2022

Keywords

Air-ground vehicle
Q reinforcement learning (RL)
mode switching
path planning
path smoothness
pursuit-evasion game

Access to Document

10.1109/TTE.2022.3142150

Cite this

Zhao, J., Yang, C., Wang, W., Xu, B., Li, Y., Yang, L., Zhu, H., & Xiang, C. (2022). A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching. IEEE Transactions on Transportation Electrification, 8(3), 3349-3366. https://doi.org/10.1109/TTE.2022.3142150

@article{4490cebde78d4dba94b2771afe6f1f12,

title = "A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching",

abstract = "Numerous missions in both civil and military fields involve the pursuit-evasion problem of vehicles. With vertical take-off and landing capability, the intelligent air-ground vehicle expands its feasible path to 3-D space, which has great advantages in the pursuit. This vehicle requires adequate path planning to obtain an optimal 3-D path and further improve the pursuit efficiency. The planning process of the air-ground vehicle currently faces the technical difficulties of acquiring the proper takeoff timing and position while optimizing the planning trajectory, especially in a complex environment with dense obstacles. To solve the above issues, a game-learning-based smooth path planning strategy for the intelligent air-ground vehicle considering mode switching is proposed in this article. First, a new reward function of the Q-learning algorithm, considering the influence of flight obstacle crossing parameters, is presented to explore the short forward track distance. Second, in the update rule, the pursuit-evasion game acts in the mode switching decisions. During interactive learning between the vehicle and environment, this game constantly updates the Nash equilibrium solutions for mode switching and gets a series of switching decisions of the pursuer vehicle (ego vehicle). Third, a double-yaw correction for path smoothing modification is proposed to reduce turning points and avoid local path deviations. This modification provides heuristic information for the exploration of the environment, which significantly speeds up the convergence speed of the algorithm. Finally, the proposed strategy is verified on a 1000 m∗1000 m map with 0-200 m obstacle height. Results show that this strategy is effective to decrease the 253-m distance compared with the traditional reinforcement learning algorithm and has a faster convergence speed. The number of trajectory direction changes is 36% less than that of the game-learning algorithm only considering mode switching. The unreasonable large angle turns are eliminated.",

keywords = "Air-ground vehicle, Q reinforcement learning (RL), mode switching, path planning, path smoothness, pursuit-evasion game",

author = "Jing Zhao and Chao Yang and Weida Wang and Bin Xu and Ying Li and Liuquan Yang and Hua Zhu and Changle Xiang",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.",

year = "2022",

month = sep,

day = "1",

doi = "10.1109/TTE.2022.3142150",

language = "English",

volume = "8",

pages = "3349--3366",

journal = "IEEE Transactions on Transportation Electrification",

issn = "2332-7782",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "3",

}

TY - JOUR

T1 - A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching

AU - Zhao, Jing

AU - Yang, Chao

AU - Wang, Weida

AU - Xu, Bin

AU - Li, Ying

AU - Yang, Liuquan

AU - Zhu, Hua

AU - Xiang, Changle

PY - 2022/9/1

Y1 - 2022/9/1

N2 - Numerous missions in both civil and military fields involve the pursuit-evasion problem of vehicles. With vertical take-off and landing capability, the intelligent air-ground vehicle expands its feasible path to 3-D space, which has great advantages in the pursuit. This vehicle requires adequate path planning to obtain an optimal 3-D path and further improve the pursuit efficiency. The planning process of the air-ground vehicle currently faces the technical difficulties of acquiring the proper takeoff timing and position while optimizing the planning trajectory, especially in a complex environment with dense obstacles. To solve the above issues, a game-learning-based smooth path planning strategy for the intelligent air-ground vehicle considering mode switching is proposed in this article. First, a new reward function of the Q-learning algorithm, considering the influence of flight obstacle crossing parameters, is presented to explore the short forward track distance. Second, in the update rule, the pursuit-evasion game acts in the mode switching decisions. During interactive learning between the vehicle and environment, this game constantly updates the Nash equilibrium solutions for mode switching and gets a series of switching decisions of the pursuer vehicle (ego vehicle). Third, a double-yaw correction for path smoothing modification is proposed to reduce turning points and avoid local path deviations. This modification provides heuristic information for the exploration of the environment, which significantly speeds up the convergence speed of the algorithm. Finally, the proposed strategy is verified on a 1000 m∗1000 m map with 0-200 m obstacle height. Results show that this strategy is effective to decrease the 253-m distance compared with the traditional reinforcement learning algorithm and has a faster convergence speed. The number of trajectory direction changes is 36% less than that of the game-learning algorithm only considering mode switching. The unreasonable large angle turns are eliminated.

AB - Numerous missions in both civil and military fields involve the pursuit-evasion problem of vehicles. With vertical take-off and landing capability, the intelligent air-ground vehicle expands its feasible path to 3-D space, which has great advantages in the pursuit. This vehicle requires adequate path planning to obtain an optimal 3-D path and further improve the pursuit efficiency. The planning process of the air-ground vehicle currently faces the technical difficulties of acquiring the proper takeoff timing and position while optimizing the planning trajectory, especially in a complex environment with dense obstacles. To solve the above issues, a game-learning-based smooth path planning strategy for the intelligent air-ground vehicle considering mode switching is proposed in this article. First, a new reward function of the Q-learning algorithm, considering the influence of flight obstacle crossing parameters, is presented to explore the short forward track distance. Second, in the update rule, the pursuit-evasion game acts in the mode switching decisions. During interactive learning between the vehicle and environment, this game constantly updates the Nash equilibrium solutions for mode switching and gets a series of switching decisions of the pursuer vehicle (ego vehicle). Third, a double-yaw correction for path smoothing modification is proposed to reduce turning points and avoid local path deviations. This modification provides heuristic information for the exploration of the environment, which significantly speeds up the convergence speed of the algorithm. Finally, the proposed strategy is verified on a 1000 m∗1000 m map with 0-200 m obstacle height. Results show that this strategy is effective to decrease the 253-m distance compared with the traditional reinforcement learning algorithm and has a faster convergence speed. The number of trajectory direction changes is 36% less than that of the game-learning algorithm only considering mode switching. The unreasonable large angle turns are eliminated.

KW - Air-ground vehicle

KW - Q reinforcement learning (RL)

KW - mode switching

KW - path planning

KW - path smoothness

KW - pursuit-evasion game

UR - http://www.scopus.com/inward/record.url?scp=85123279380&partnerID=8YFLogxK

U2 - 10.1109/TTE.2022.3142150

DO - 10.1109/TTE.2022.3142150

M3 - Article

AN - SCOPUS:85123279380

SN - 2332-7782

VL - 8

SP - 3349

EP - 3366

JO - IEEE Transactions on Transportation Electrification

JF - IEEE Transactions on Transportation Electrification

IS - 3

ER -

A Game-Learning-Based Smooth Path Planning Strategy for Intelligent Air-Ground Vehicle Considering Mode Switching

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this