Abstract
To address the challenge posed by saturated attacks of drone swarms to air defense systems, and to achieve the winning goal of “using swarms to counter swarms”, a cooperative target assignment method based on proximal policy optimization (PPO) was proposed. The approach incorporated an attention mechanism to capture interaction features between intercepting and target clusters, enhancing the model’s situational awareness. A hierarchical masking mechanism was also introduced to handle variable-scale target clusters, dynamically screen available interceptors, and avoid fire overlap, thereby satisfying cooperative constraints. Experiments demonstrate that the method maintains good generalization and robustness in complex adversarial scenarios, offering a new solution for intelligent target assignment under dynamic threats.
| Translated title of the contribution | 基于强化学习的反异构无人机集群协同目标分配方法 |
|---|---|
| Original language | English |
| Pages (from-to) | 527-533 |
| Number of pages | 7 |
| Journal | Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology |
| Volume | 46 |
| Issue number | 5 |
| DOIs | |
| Publication status | Published - 2026 |
| Externally published | Yes |
Keywords
- air defense interception
- dynamic target assignment
- proximal policy optimization (PPO)
- self-attention mechanism
Fingerprint
Dive into the research topics of 'Reinforcement Learning-Based Method for Collaborative Target Assignment Against Heterogeneous UAV Swarms'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver