Skip to main navigation Skip to search Skip to main content

Reinforcement Learning-Based Method for Collaborative Target Assignment Against Heterogeneous UAV Swarms

  • Beijing Institute of Technology
  • Shenzhen MSU-BIT University

Research output: Contribution to journalArticlepeer-review

Abstract

To address the challenge posed by saturated attacks of drone swarms to air defense systems, and to achieve the winning goal of “using swarms to counter swarms”, a cooperative target assignment method based on proximal policy optimization (PPO) was proposed. The approach incorporated an attention mechanism to capture interaction features between intercepting and target clusters, enhancing the model’s situational awareness. A hierarchical masking mechanism was also introduced to handle variable-scale target clusters, dynamically screen available interceptors, and avoid fire overlap, thereby satisfying cooperative constraints. Experiments demonstrate that the method maintains good generalization and robustness in complex adversarial scenarios, offering a new solution for intelligent target assignment under dynamic threats.

Translated title of the contribution基于强化学习的反异构无人机集群协同目标分配方法
Original languageEnglish
Pages (from-to)527-533
Number of pages7
JournalBeijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
Volume46
Issue number5
DOIs
Publication statusPublished - 2026
Externally publishedYes

Keywords

  • air defense interception
  • dynamic target assignment
  • proximal policy optimization (PPO)
  • self-attention mechanism

Fingerprint

Dive into the research topics of 'Reinforcement Learning-Based Method for Collaborative Target Assignment Against Heterogeneous UAV Swarms'. Together they form a unique fingerprint.

Cite this