Computational missile guidance: a deep reinforcement learning approach

Shaoming He; Hyo Sang Shin; Antonios Tsourdos

doi:10.2514/1.I010970

Computational missile guidance: a deep reinforcement learning approach

Shaoming He, Hyo Sang Shin, Antonios Tsourdos

宇航学院

Cranfield University

科研成果: 期刊稿件 › 文章 › 同行评审

73 引用（Scopus）

摘要

This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in missile guidance applications. To this end, a Markovian decision process that enables the application of reinforcement learning theory to solve the guidance problem is formulated. A heuristic way is used to shape a proper reward function that has tradeoff between guidance accuracy, energy consumption, and interception time. The state-of-the-art deep deterministic policy gradient algorithm is used to learn an action policy that maps the observed engagements states to a guidance command. Extensive empirical numerical simulations are performed to validate the proposed computational guidance algorithm.

源语言	英语
页（从-至）	571-582
页数	12
期刊	Journal of Aerospace Information Systems
卷	18
期	8
DOI	https://doi.org/10.2514/1.I010970
出版状态	已出版 - 2021

联合国可持续发展目标

此成果有助于实现下列可持续发展目标：

访问文件

10.2514/1.I010970

其它文件与链接

链接到 Scopus 的出版物

引用此

He, S., Shin, H. S., & Tsourdos, A. (2021). Computational missile guidance: a deep reinforcement learning approach. Journal of Aerospace Information Systems, 18(8), 571-582. https://doi.org/10.2514/1.I010970

@article{dfe766709f6a44d2902d17ee27b0568d,

title = "Computational missile guidance: a deep reinforcement learning approach",

abstract = "This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in missile guidance applications. To this end, a Markovian decision process that enables the application of reinforcement learning theory to solve the guidance problem is formulated. A heuristic way is used to shape a proper reward function that has tradeoff between guidance accuracy, energy consumption, and interception time. The state-of-the-art deep deterministic policy gradient algorithm is used to learn an action policy that maps the observed engagements states to a guidance command. Extensive empirical numerical simulations are performed to validate the proposed computational guidance algorithm.",

author = "Shaoming He and Shin, {Hyo Sang} and Antonios Tsourdos",

note = "Publisher Copyright: {\textcopyright} 2021 by Hyo-Sang Shin. Published by the American Institute of Aeronautics and Astronautics, Inc., with permission.",

year = "2021",

doi = "10.2514/1.I010970",

language = "English",

volume = "18",

pages = "571--582",

journal = "Journal of Aerospace Information Systems",

issn = "1940-3151",

publisher = "American Institute of Aeronautics and Astronautics Inc. (AIAA)",

number = "8",

}

TY - JOUR

T1 - Computational missile guidance

T2 - a deep reinforcement learning approach

AU - He, Shaoming

AU - Shin, Hyo Sang

AU - Tsourdos, Antonios

PY - 2021

Y1 - 2021

N2 - This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in missile guidance applications. To this end, a Markovian decision process that enables the application of reinforcement learning theory to solve the guidance problem is formulated. A heuristic way is used to shape a proper reward function that has tradeoff between guidance accuracy, energy consumption, and interception time. The state-of-the-art deep deterministic policy gradient algorithm is used to learn an action policy that maps the observed engagements states to a guidance command. Extensive empirical numerical simulations are performed to validate the proposed computational guidance algorithm.

AB - This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in missile guidance applications. To this end, a Markovian decision process that enables the application of reinforcement learning theory to solve the guidance problem is formulated. A heuristic way is used to shape a proper reward function that has tradeoff between guidance accuracy, energy consumption, and interception time. The state-of-the-art deep deterministic policy gradient algorithm is used to learn an action policy that maps the observed engagements states to a guidance command. Extensive empirical numerical simulations are performed to validate the proposed computational guidance algorithm.

UR - http://www.scopus.com/inward/record.url?scp=85114317291&partnerID=8YFLogxK

U2 - 10.2514/1.I010970

DO - 10.2514/1.I010970

M3 - Article

AN - SCOPUS:85114317291

SN - 1940-3151

VL - 18

SP - 571

EP - 582

JO - Journal of Aerospace Information Systems

JF - Journal of Aerospace Information Systems

IS - 8

ER -

Computational missile guidance: a deep reinforcement learning approach

摘要

联合国可持续发展目标

访问文件

其它文件与链接

指纹

引用此