TY - GEN
T1 - Deep Reinforcement Learning Based 3D Integrated Guidance And Control For Hypersonic Missiles
AU - Xie, Tian
AU - Feng, Xiaoxue
AU - Wen, Yue
AU - Jiang, Xinyi
AU - Pan, Feng
AU - Li, Zhenxu
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - The terminal guidance of hypersonic missiles in 3D space presents strong nonlinearity and coupling. The traditional dual loop guidance methods cannot meet the missile dynamic constraints with time delays. In response to the above issues, this paper proposes an improved deep reinforcement learning algorithm using a non-perfect classifier, trying to prevent invalid experience from entering the experience pool as much as possible, so that the agent can get rid of them when updating strategies. Firstly, the missile guidance and control problem is modeled as a Markov decision process. Then, a simulation environment is constructed based on the dynamic model of hypersonic missiles, an appropriate state space and a dense reward function based on non-perfect classifier are both designed. The soft actor-critic algorithm is utilized to train the agent. An integrated guidance and control strategy is finally obtained, which can generate real-time rudder angle instructions to hit the target based on the current state. The effectiveness, generality, and robustness of the method have been verified through several simulation experiments.
AB - The terminal guidance of hypersonic missiles in 3D space presents strong nonlinearity and coupling. The traditional dual loop guidance methods cannot meet the missile dynamic constraints with time delays. In response to the above issues, this paper proposes an improved deep reinforcement learning algorithm using a non-perfect classifier, trying to prevent invalid experience from entering the experience pool as much as possible, so that the agent can get rid of them when updating strategies. Firstly, the missile guidance and control problem is modeled as a Markov decision process. Then, a simulation environment is constructed based on the dynamic model of hypersonic missiles, an appropriate state space and a dense reward function based on non-perfect classifier are both designed. The soft actor-critic algorithm is utilized to train the agent. An integrated guidance and control strategy is finally obtained, which can generate real-time rudder angle instructions to hit the target based on the current state. The effectiveness, generality, and robustness of the method have been verified through several simulation experiments.
KW - Deep reinforcement learning
KW - Integrated guidance and control
KW - Non-perfect classifier
KW - Strong Coupling
UR - http://www.scopus.com/inward/record.url?scp=85200317107&partnerID=8YFLogxK
U2 - 10.1109/CCDC62350.2024.10588133
DO - 10.1109/CCDC62350.2024.10588133
M3 - Conference contribution
AN - SCOPUS:85200317107
T3 - Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024
SP - 3535
EP - 3540
BT - Proceedings of the 36th Chinese Control and Decision Conference, CCDC 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 36th Chinese Control and Decision Conference, CCDC 2024
Y2 - 25 May 2024 through 27 May 2024
ER -