摘要
This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in missile guidance applications. To this end, a Markovian decision process that enables the application of reinforcement learning theory to solve the guidance problem is formulated. A heuristic way is used to shape a proper reward function that has tradeoff between guidance accuracy, energy consumption, and interception time. The state-of-the-art deep deterministic policy gradient algorithm is used to learn an action policy that maps the observed engagements states to a guidance command. Extensive empirical numerical simulations are performed to validate the proposed computational guidance algorithm.
源语言 | 英语 |
---|---|
页(从-至) | 571-582 |
页数 | 12 |
期刊 | Journal of Aerospace Information Systems |
卷 | 18 |
期 | 8 |
DOI | |
出版状态 | 已出版 - 2021 |