跳到主要导航 跳到搜索 跳到主要内容

Supervision-Enhanced Deep Reinforcement Learning for Hybrid Action in BVR Engagement

  • Haoran Han
  • , Jian Cheng*
  • , Maolong Lv*
  • , Li Mo
  • *此作品的通讯作者
  • University of Electronic Science and Technology of China
  • Air Force Engineering University Xian
  • Beijing Institute of Technology

科研成果: 期刊稿件文章同行评审

摘要

Deep reinforcement learning has gained increasing popularity in the beyond visual range engagement of unmanned combat aerial vehicles. Regardless of the success of the maneuvering decision, existing methods either utilize sparse rewards to optimize fire control or adopt a predefined fire policy, severely limiting the engagement performance. This paper presents a supervision-enhanced technique to address the challenges of the hybrid action space. A dual-head policy is developed, with one head dedicated to maneuvering decisions and the other to fire control. Alongside the reinforcement learning memory that stores state transitions, supervised learning memory is introduced to store pairs of states where missiles are launched and the hit outcomes. The maneuvering head is trained with the soft policy gradient, while the fire control head is optimized with the binary cross-entropy loss. This method enables agents to make maneuvering decisions and predict missile hit probabilities simultaneously without prior expert knowledge. Simulation demonstrates the superiority of the proposed method.

源语言英语
页(从-至)1138-1145
页数8
期刊IEEE Robotics and Automation Letters
11
2
DOI
出版状态已出版 - 2026
已对外发布

指纹

探究 'Supervision-Enhanced Deep Reinforcement Learning for Hybrid Action in BVR Engagement' 的科研主题。它们共同构成独一无二的指纹。

引用此