A-MAPPO: Attention-Enhanced Multi-Agent Proximal Policy Optimization

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In the domain of multi-agent reinforcement learning, the scalability of multi-agent systems presents challenges for conventional policy-based methods. As the scale increases, these methods struggle due to the growing state space and partially observable Markov decision process, which are further exacerbated by the interference between observations. This paper introduces a novel framework for enhancing multi-agent proximal policy optimization with a hard attention network. All of the features in the observation vector of one particular agent can be re-sorted according to their calculated attention values, and only those are relatively important are preserved and aggregated for decision making. Within the resorting and pruning manipulations based on hard attention, the input space of actor network is efficiently reduced, leading to faster and more stable learning for policy and critics. Our framework outperforms the vanilla multi-agent proximal policy optimization algorithm on cluster confrontation tasks of various scales and ensures training success even under extreme observation interference.

Original languageEnglish
Title of host publicationProceedings of the 2nd Aerospace Frontiers Conference, AFC 2025 - Volume V
PublisherSpringer Science and Business Media Deutschland GmbH
Pages281-292
Number of pages12
ISBN (Print)9789819529971
DOIs
Publication statusPublished - 2026
Event2nd Aerospace Frontiers Conference, AFC 2025 - Beijing, China
Duration: 11 Apr 202514 Apr 2025

Publication series

NameLecture Notes in Mechanical Engineering
ISSN (Print)2195-4356
ISSN (Electronic)2195-4364

Conference

Conference2nd Aerospace Frontiers Conference, AFC 2025
Country/TerritoryChina
CityBeijing
Period11/04/2514/04/25

Keywords

  • Deep reinforcement learning
  • hard attention mechanism
  • multi-agent coordination
  • multi-agent reinforcement learning
  • multi-agent systems

Fingerprint

Dive into the research topics of 'A-MAPPO: Attention-Enhanced Multi-Agent Proximal Policy Optimization'. Together they form a unique fingerprint.

Cite this