Computational missile guidance: a deep reinforcement learning approach

Shaoming He; Hyo Sang Shin; Antonios Tsourdos

doi:10.2514/1.I010970

Computational missile guidance: a deep reinforcement learning approach

Shaoming He, Hyo Sang Shin, Antonios Tsourdos

School of Aerospace Engineering

Cranfield University

Research output: Contribution to journal › Article › peer-review

61 Citations (Scopus)

Abstract

This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in missile guidance applications. To this end, a Markovian decision process that enables the application of reinforcement learning theory to solve the guidance problem is formulated. A heuristic way is used to shape a proper reward function that has tradeoff between guidance accuracy, energy consumption, and interception time. The state-of-the-art deep deterministic policy gradient algorithm is used to learn an action policy that maps the observed engagements states to a guidance command. Extensive empirical numerical simulations are performed to validate the proposed computational guidance algorithm.

Original language	English
Pages (from-to)	571-582
Number of pages	12
Journal	Journal of Aerospace Information Systems
Volume	18
Issue number	8
DOIs	https://doi.org/10.2514/1.I010970
Publication status	Published - 2021

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.2514/1.I010970

Cite this

@article{dfe766709f6a44d2902d17ee27b0568d,

title = "Computational missile guidance: a deep reinforcement learning approach",

abstract = "This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in missile guidance applications. To this end, a Markovian decision process that enables the application of reinforcement learning theory to solve the guidance problem is formulated. A heuristic way is used to shape a proper reward function that has tradeoff between guidance accuracy, energy consumption, and interception time. The state-of-the-art deep deterministic policy gradient algorithm is used to learn an action policy that maps the observed engagements states to a guidance command. Extensive empirical numerical simulations are performed to validate the proposed computational guidance algorithm.",

author = "Shaoming He and Shin, {Hyo Sang} and Antonios Tsourdos",

note = "Publisher Copyright: {\textcopyright} 2021 by Hyo-Sang Shin. Published by the American Institute of Aeronautics and Astronautics, Inc., with permission.",

year = "2021",

doi = "10.2514/1.I010970",

language = "English",

volume = "18",

pages = "571--582",

journal = "Journal of Aerospace Information Systems",

issn = "1940-3151",

publisher = "American Institute of Aeronautics and Astronautics Inc. (AIAA)",

number = "8",

}

TY - JOUR

T1 - Computational missile guidance

T2 - a deep reinforcement learning approach

AU - He, Shaoming

AU - Shin, Hyo Sang

AU - Tsourdos, Antonios

PY - 2021

Y1 - 2021

N2 - This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in missile guidance applications. To this end, a Markovian decision process that enables the application of reinforcement learning theory to solve the guidance problem is formulated. A heuristic way is used to shape a proper reward function that has tradeoff between guidance accuracy, energy consumption, and interception time. The state-of-the-art deep deterministic policy gradient algorithm is used to learn an action policy that maps the observed engagements states to a guidance command. Extensive empirical numerical simulations are performed to validate the proposed computational guidance algorithm.

AB - This paper aims to examine the potential of using the emerging deep reinforcement learning techniques in missile guidance applications. To this end, a Markovian decision process that enables the application of reinforcement learning theory to solve the guidance problem is formulated. A heuristic way is used to shape a proper reward function that has tradeoff between guidance accuracy, energy consumption, and interception time. The state-of-the-art deep deterministic policy gradient algorithm is used to learn an action policy that maps the observed engagements states to a guidance command. Extensive empirical numerical simulations are performed to validate the proposed computational guidance algorithm.

UR - http://www.scopus.com/inward/record.url?scp=85114317291&partnerID=8YFLogxK

U2 - 10.2514/1.I010970

DO - 10.2514/1.I010970

M3 - Article

AN - SCOPUS:85114317291

SN - 1940-3151

VL - 18

SP - 571

EP - 582

JO - Journal of Aerospace Information Systems

JF - Journal of Aerospace Information Systems

IS - 8

ER -

Computational missile guidance: a deep reinforcement learning approach

Abstract

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this