Deep reinforcement learning and its application in autonomous fitting optimization for attack areas of UCAVs

Li Yue; Qiu Xiaohui; Liu Xiaodong; Xia Qunli

doi:10.23919/JSEE.2020.000048

Deep reinforcement learning and its application in autonomous fitting optimization for attack areas of UCAVs

Li Yue, Qiu Xiaohui^*, Liu Xiaodong, Xia Qunli

^*Corresponding author for this work

School of Aerospace Engineering

Research output: Contribution to journal › Article › peer-review

34 Citations (Scopus)

Abstract

The ever-changing battlefield environment requires the use of robust and adaptive technologies integrated into a reliable platform. Unmanned combat aerial vehicles (UCAVs) aim to integrate such advanced technologies while increasing the tactical capabilities of combat aircraft. As a research object, common UCAV uses the neural network fitting strategy to obtain values of attack areas. However, this simple strategy cannot cope with complex environmental changes and autonomously optimize decision-making problems. To solve the problem, this paper proposes a new deep deterministic policy gradient (DDPG) strategy based on deep reinforcement learning for the attack area fitting of UCAVs in the future battlefield. Simulation results show that the autonomy and environmental adaptability of UCAVs in the future battlefield will be improved based on the new DDPG algorithm and the training process converges quickly. We can obtain the optimal values of attack areas in real time during the whole flight with the well-trained deep network.

Original language	English
Pages (from-to)	734-742
Number of pages	9
Journal	Journal of Systems Engineering and Electronics
Volume	31
Issue number	4
DOIs	https://doi.org/10.23919/JSEE.2020.000048
Publication status	Published - Aug 2020

Keywords

attack area
deep deterministic policy gradient (DDPG)
neural network
unmanned combat aerial vehicle (UCAV)

Access to Document

10.23919/JSEE.2020.000048

Cite this

@article{b1194abe8e614e148f9cf7862debc80f,

title = "Deep reinforcement learning and its application in autonomous fitting optimization for attack areas of UCAVs",

abstract = "The ever-changing battlefield environment requires the use of robust and adaptive technologies integrated into a reliable platform. Unmanned combat aerial vehicles (UCAVs) aim to integrate such advanced technologies while increasing the tactical capabilities of combat aircraft. As a research object, common UCAV uses the neural network fitting strategy to obtain values of attack areas. However, this simple strategy cannot cope with complex environmental changes and autonomously optimize decision-making problems. To solve the problem, this paper proposes a new deep deterministic policy gradient (DDPG) strategy based on deep reinforcement learning for the attack area fitting of UCAVs in the future battlefield. Simulation results show that the autonomy and environmental adaptability of UCAVs in the future battlefield will be improved based on the new DDPG algorithm and the training process converges quickly. We can obtain the optimal values of attack areas in real time during the whole flight with the well-trained deep network.",

keywords = "attack area, deep deterministic policy gradient (DDPG), neural network, unmanned combat aerial vehicle (UCAV)",

author = "Li Yue and Qiu Xiaohui and Liu Xiaodong and Xia Qunli",

note = "Publisher Copyright: {\textcopyright} 1990-2011 Beijing Institute of Aerospace Information.",

year = "2020",

month = aug,

doi = "10.23919/JSEE.2020.000048",

language = "English",

volume = "31",

pages = "734--742",

journal = "Journal of Systems Engineering and Electronics",

issn = "1671-1793",

publisher = "Beijing Institute of Aerospace Information",

number = "4",

}

TY - JOUR

T1 - Deep reinforcement learning and its application in autonomous fitting optimization for attack areas of UCAVs

AU - Yue, Li

AU - Xiaohui, Qiu

AU - Xiaodong, Liu

AU - Qunli, Xia

PY - 2020/8

Y1 - 2020/8

N2 - The ever-changing battlefield environment requires the use of robust and adaptive technologies integrated into a reliable platform. Unmanned combat aerial vehicles (UCAVs) aim to integrate such advanced technologies while increasing the tactical capabilities of combat aircraft. As a research object, common UCAV uses the neural network fitting strategy to obtain values of attack areas. However, this simple strategy cannot cope with complex environmental changes and autonomously optimize decision-making problems. To solve the problem, this paper proposes a new deep deterministic policy gradient (DDPG) strategy based on deep reinforcement learning for the attack area fitting of UCAVs in the future battlefield. Simulation results show that the autonomy and environmental adaptability of UCAVs in the future battlefield will be improved based on the new DDPG algorithm and the training process converges quickly. We can obtain the optimal values of attack areas in real time during the whole flight with the well-trained deep network.

AB - The ever-changing battlefield environment requires the use of robust and adaptive technologies integrated into a reliable platform. Unmanned combat aerial vehicles (UCAVs) aim to integrate such advanced technologies while increasing the tactical capabilities of combat aircraft. As a research object, common UCAV uses the neural network fitting strategy to obtain values of attack areas. However, this simple strategy cannot cope with complex environmental changes and autonomously optimize decision-making problems. To solve the problem, this paper proposes a new deep deterministic policy gradient (DDPG) strategy based on deep reinforcement learning for the attack area fitting of UCAVs in the future battlefield. Simulation results show that the autonomy and environmental adaptability of UCAVs in the future battlefield will be improved based on the new DDPG algorithm and the training process converges quickly. We can obtain the optimal values of attack areas in real time during the whole flight with the well-trained deep network.

KW - attack area

KW - deep deterministic policy gradient (DDPG)

KW - neural network

KW - unmanned combat aerial vehicle (UCAV)

UR - http://www.scopus.com/inward/record.url?scp=85091877439&partnerID=8YFLogxK

U2 - 10.23919/JSEE.2020.000048

DO - 10.23919/JSEE.2020.000048

M3 - Article

AN - SCOPUS:85091877439

SN - 1671-1793

VL - 31

SP - 734

EP - 742

JO - Journal of Systems Engineering and Electronics

JF - Journal of Systems Engineering and Electronics

IS - 4

ER -

Deep reinforcement learning and its application in autonomous fitting optimization for attack areas of UCAVs

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this