TY - JOUR
T1 - Impact time control guidance law with time-varying velocity based on deep reinforcement learning
AU - Yang, Zhuoqiao
AU - Liu, Xiangdong
AU - Liu, Haikuo
N1 - Publisher Copyright:
© 2023 Elsevier Masson SAS
PY - 2023/11
Y1 - 2023/11
N2 - This paper investigates the problem of impact-time-control guidance law with the time-varying velocity caused by gravity and aerodynamic drag. Using the deep reinforcement learning (DRL) algorithm, we propose a novel impact time control guidance (ITCG) law in which a DRL agent is trained from scratch without using any prior knowledge. Different from the traditional ITCG law, the proposed method doesn't rely on the time-to-go estimation, which is difficult to derive and inaccurate with the time-varying velocity. Further, a prioritized experience replay method and a novel action exploration method are introduced in the DRL algorithm to improve learning efficiency. Additionally, the agent action is shaped to provide smooth guidance command, which avoids the problem that the guidance command generated by the intelligent algorithm may not be continuous. Numerical simulations are conducted to support the validity of the proposed algorithm.
AB - This paper investigates the problem of impact-time-control guidance law with the time-varying velocity caused by gravity and aerodynamic drag. Using the deep reinforcement learning (DRL) algorithm, we propose a novel impact time control guidance (ITCG) law in which a DRL agent is trained from scratch without using any prior knowledge. Different from the traditional ITCG law, the proposed method doesn't rely on the time-to-go estimation, which is difficult to derive and inaccurate with the time-varying velocity. Further, a prioritized experience replay method and a novel action exploration method are introduced in the DRL algorithm to improve learning efficiency. Additionally, the agent action is shaped to provide smooth guidance command, which avoids the problem that the guidance command generated by the intelligent algorithm may not be continuous. Numerical simulations are conducted to support the validity of the proposed algorithm.
KW - Deep reinforcement learning
KW - Impact time control guidance
KW - Missile guidance
KW - Time-varying velocity
UR - http://www.scopus.com/inward/record.url?scp=85171350448&partnerID=8YFLogxK
U2 - 10.1016/j.ast.2023.108603
DO - 10.1016/j.ast.2023.108603
M3 - Article
AN - SCOPUS:85171350448
SN - 1270-9638
VL - 142
JO - Aerospace Science and Technology
JF - Aerospace Science and Technology
M1 - 108603
ER -