Multi-agent Reinforcement Learning with Multi-head Attention

Ke Ni, Jing Chen*, Jian Wang, Bo Liu, Ting Lei

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Multi-agent reinforcement learning(MARL) methods have become an important approach to solving the decision making problems of agents. As the environment's complexity increases, the attention model can effectively solve the problem of information redundancy. However, the introduction of attention models in reinforcement learning may also lead to over-focusing and neglecting other potentially useful information. Moreover, the presence of attention would slow the convergence in the early stages of training. To address the above problem, we propose a divided attention reinforcement learning approach: (i) the involvement of an attention regularization term to make agents more divergent in their focus on different directions; (ii) the use of a layer normalization network structure and the use of a Pre-Layer Normalization(Pre-LN) network structure for the attention optimization in the initialization phase of training. It allows the agents to have a more stable and smooth gradient descent in the early stages of learning. Our approach has been tested in several multi-agent environment tasks. Compared to other related multi-agent methods, our method obtains higher final rewards and training efficiency.

源语言英语
主期刊名ITNEC 2023 - IEEE 6th Information Technology, Networking, Electronic and Automation Control Conference
编辑Bing Xu
出版商Institute of Electrical and Electronics Engineers Inc.
1311-1315
页数5
ISBN(电子版)9781665460033
DOI
出版状态已出版 - 2023
活动6th IEEE Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2023 - Chongqing, 中国
期限: 24 2月 202326 2月 2023

出版系列

姓名ITNEC 2023 - IEEE 6th Information Technology, Networking, Electronic and Automation Control Conference

会议

会议6th IEEE Information Technology, Networking, Electronic and Automation Control Conference, ITNEC 2023
国家/地区中国
Chongqing
时期24/02/2326/02/23

指纹

探究 'Multi-agent Reinforcement Learning with Multi-head Attention' 的科研主题。它们共同构成独一无二的指纹。

引用此