Learning-Based Policy Optimization for Adversarial Missile-Target Assignment

Weilin Luo, Jinhu Lu*, Kexin Liu, Lei Chen

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

36 引用 (Scopus)

摘要

The missile-target assignment (MTA) is a typical weapon-target assignment problem in Command and Control of modern warfare. Despite the significance of the problem, traditional algorithms still lack efficiency, solution quality, and practicability in the adversarial environment. In this article, we propose a data-driven policy optimization with deep reinforcement learning (PODRL) for the adversarial MTA. We design a comprehensive reward function to motivate the optimization of assignment policy. As such, the learned policy can implicitly model the penetration of missiles under an adversarial environment in a data-driven way. We also present a fair sample strategy to improve the sample efficiency and accelerate the policy optimization. Experimental results show that PODRL can adaptively generate satisfactory solutions in both small-scale and large-scale instances. Furthermore, we evaluate the effectiveness of PODRL in a multiobjective scenario. The result demonstrates that a well-optimized policy can achieve high-quality allocation and demand forecast of the missile resources simultaneously.

源语言英语
页(从-至)4426-4437
页数12
期刊IEEE Transactions on Systems, Man, and Cybernetics: Systems
52
7
DOI
出版状态已出版 - 1 7月 2022

指纹

探究 'Learning-Based Policy Optimization for Adversarial Missile-Target Assignment' 的科研主题。它们共同构成独一无二的指纹。

引用此