TY - GEN
T1 - BGC
T2 - 3rd International Conference on Distributed Artificial Intelligence, DAI 2021
AU - Zhou, Tianze
AU - Zhang, Fubiao
AU - Tang, Pan
AU - Wang, Chenfei
N1 - Publisher Copyright:
© 2022, Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - Recent advances have witnessed that value decomposed-based multi-agent reinforcement learning methods make an efficient performance in coordination tasks. Most current methods assume that agents can communicate to assist decisions, which is impractical in some real situations. In this paper, we propose an observation-to-cognition method to enable agents to realize high efficient coordination without communication. Inspired by the neighborhood cognitive consistency (NCC), we introduce the group concept to help agents learn a belief, a type of consensus, to realize that adjacent agents tend to accomplish similar sub-tasks to achieve cooperation. We propose a novel agent structure named Belief in Graph Clustering (BGC) via Graph Attention Network (GAT) to generate agent group belief. In this module, we further utilize an MLP-based module to characterize special agent features to express the unique characteristics of each agent. Besides, to overcome the consistent agent problem of NCC, a split loss is introduced to distinguish different agents and reduce the number of groups. Results reveal that the proposed method makes excellent coordination and achieves a significant improvement in the SMAC benchmark. Due to the group concept, our approach maintains excellent performance with an increase in the number of agents.
AB - Recent advances have witnessed that value decomposed-based multi-agent reinforcement learning methods make an efficient performance in coordination tasks. Most current methods assume that agents can communicate to assist decisions, which is impractical in some real situations. In this paper, we propose an observation-to-cognition method to enable agents to realize high efficient coordination without communication. Inspired by the neighborhood cognitive consistency (NCC), we introduce the group concept to help agents learn a belief, a type of consensus, to realize that adjacent agents tend to accomplish similar sub-tasks to achieve cooperation. We propose a novel agent structure named Belief in Graph Clustering (BGC) via Graph Attention Network (GAT) to generate agent group belief. In this module, we further utilize an MLP-based module to characterize special agent features to express the unique characteristics of each agent. Besides, to overcome the consistent agent problem of NCC, a split loss is introduced to distinguish different agents and reduce the number of groups. Results reveal that the proposed method makes excellent coordination and achieves a significant improvement in the SMAC benchmark. Due to the group concept, our approach maintains excellent performance with an increase in the number of agents.
KW - Graph attention network
KW - Group concept
KW - Multi-agent reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85123415913&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-94662-3_4
DO - 10.1007/978-3-030-94662-3_4
M3 - Conference contribution
AN - SCOPUS:85123415913
SN - 9783030946616
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 52
EP - 63
BT - Distributed Artificial Intelligence - 3rd International Conference, DAI 2021, Proceedings
A2 - Chen, Jie
A2 - Lang, Jérôme
A2 - Amato, Christopher
A2 - Zhao, Dengji
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 17 December 2021 through 18 December 2021
ER -