HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy

Hancheng Zhang; Guozheng Li; Chi Harold Liu; Guoren Wang; Jian Tang

doi:10.1145/3580305.3599379

HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy

Hancheng Zhang, Guozheng Li^*, Chi Harold Liu, Guoren Wang, Jian Tang

^*此作品的通讯作者

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Multi-agent deep reinforcement learning (MADRL) has been widely used in many scenarios such as robotics and game AI. However, existing methods mainly focus on the optimization of agents' micro policies without considering the macro strategy. As a result, they cannot perform well in complex or sparse reward scenarios like the StarCraft Multi-Agent Challenge (SMAC) and Google Research Football (GRF). To this end, we propose a hierarchical MADRL framework called "HiMacMic"with dynamic asynchronous macro strategy. Spatially, HiMacMic determines a critical position by using a positional heat map. Temporally, the macro strategy dynamically decides its deadline and updates it asynchronously among agents. We validate HiMacMic in four widely used benchmarks, namely: Overcooked, GRF, SMAC and SMAC-v2 with nine chosen scenarios. Results show that HiMacMic not only converges faster and achieves higher results than ten existing approaches, but also shows its adaptability to different environment settings.

源语言	英语
主期刊名	KDD 2023 - Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
出版商	Association for Computing Machinery
页	3239-3248
页数	10
ISBN（电子版）	9798400701030
DOI	https://doi.org/10.1145/3580305.3599379
出版状态	已出版 - 6 8月 2023
活动	29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023 - Long Beach, 美国期限: 6 8月 2023 → 10 8月 2023

出版系列

姓名	Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

会议

会议	29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023
国家/地区	美国
市	Long Beach
时期	6/08/23 → 10/08/23

访问文件

10.1145/3580305.3599379

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, H., Li, G., Liu, C. H., Wang, G., & Tang, J. (2023). HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy. 在 KDD 2023 - Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (页码 3239-3248). (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining). Association for Computing Machinery. https://doi.org/10.1145/3580305.3599379

Zhang, Hancheng ; Li, Guozheng ; Liu, Chi Harold 等. / HiMacMic : Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy. KDD 2023 - Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, 2023. 页码 3239-3248 (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining).

@inproceedings{bd4d8d21a69344b3b427f0cb6aaf75b8,

title = "HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy",

abstract = "Multi-agent deep reinforcement learning (MADRL) has been widely used in many scenarios such as robotics and game AI. However, existing methods mainly focus on the optimization of agents' micro policies without considering the macro strategy. As a result, they cannot perform well in complex or sparse reward scenarios like the StarCraft Multi-Agent Challenge (SMAC) and Google Research Football (GRF). To this end, we propose a hierarchical MADRL framework called {"}HiMacMic{"}with dynamic asynchronous macro strategy. Spatially, HiMacMic determines a critical position by using a positional heat map. Temporally, the macro strategy dynamically decides its deadline and updates it asynchronously among agents. We validate HiMacMic in four widely used benchmarks, namely: Overcooked, GRF, SMAC and SMAC-v2 with nine chosen scenarios. Results show that HiMacMic not only converges faster and achieves higher results than ten existing approaches, but also shows its adaptability to different environment settings.",

keywords = "macro strategy, multi-agent deep reinforcement learning",

author = "Hancheng Zhang and Guozheng Li and Liu, {Chi Harold} and Guoren Wang and Jian Tang",

note = "Publisher Copyright: {\textcopyright} 2023 ACM.; 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023 ; Conference date: 06-08-2023 Through 10-08-2023",

year = "2023",

month = aug,

day = "6",

doi = "10.1145/3580305.3599379",

language = "English",

series = "Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining",

publisher = "Association for Computing Machinery",

pages = "3239--3248",

booktitle = "KDD 2023 - Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining",

}

Zhang, H, Li, G , Liu, CH , Wang, G & Tang, J 2023, HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy. 在 KDD 2023 - Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Association for Computing Machinery, 页码 3239-3248, 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023, Long Beach, 美国, 6/08/23. https://doi.org/10.1145/3580305.3599379

HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy. / Zhang, Hancheng; Li, Guozheng ; Liu, Chi Harold 等.
KDD 2023 - Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery, 2023. 页码 3239-3248 (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - HiMacMic

T2 - 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2023

AU - Zhang, Hancheng

AU - Li, Guozheng

AU - Liu, Chi Harold

AU - Wang, Guoren

AU - Tang, Jian

PY - 2023/8/6

Y1 - 2023/8/6

N2 - Multi-agent deep reinforcement learning (MADRL) has been widely used in many scenarios such as robotics and game AI. However, existing methods mainly focus on the optimization of agents' micro policies without considering the macro strategy. As a result, they cannot perform well in complex or sparse reward scenarios like the StarCraft Multi-Agent Challenge (SMAC) and Google Research Football (GRF). To this end, we propose a hierarchical MADRL framework called "HiMacMic"with dynamic asynchronous macro strategy. Spatially, HiMacMic determines a critical position by using a positional heat map. Temporally, the macro strategy dynamically decides its deadline and updates it asynchronously among agents. We validate HiMacMic in four widely used benchmarks, namely: Overcooked, GRF, SMAC and SMAC-v2 with nine chosen scenarios. Results show that HiMacMic not only converges faster and achieves higher results than ten existing approaches, but also shows its adaptability to different environment settings.

AB - Multi-agent deep reinforcement learning (MADRL) has been widely used in many scenarios such as robotics and game AI. However, existing methods mainly focus on the optimization of agents' micro policies without considering the macro strategy. As a result, they cannot perform well in complex or sparse reward scenarios like the StarCraft Multi-Agent Challenge (SMAC) and Google Research Football (GRF). To this end, we propose a hierarchical MADRL framework called "HiMacMic"with dynamic asynchronous macro strategy. Spatially, HiMacMic determines a critical position by using a positional heat map. Temporally, the macro strategy dynamically decides its deadline and updates it asynchronously among agents. We validate HiMacMic in four widely used benchmarks, namely: Overcooked, GRF, SMAC and SMAC-v2 with nine chosen scenarios. Results show that HiMacMic not only converges faster and achieves higher results than ten existing approaches, but also shows its adaptability to different environment settings.

KW - macro strategy

KW - multi-agent deep reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85171346810&partnerID=8YFLogxK

U2 - 10.1145/3580305.3599379

DO - 10.1145/3580305.3599379

M3 - Conference contribution

AN - SCOPUS:85171346810

T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

SP - 3239

EP - 3248

BT - KDD 2023 - Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

PB - Association for Computing Machinery

Y2 - 6 August 2023 through 10 August 2023

ER -

Zhang H, Li G , Liu CH , Wang G, Tang J. HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy. 在 KDD 2023 - Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. Association for Computing Machinery. 2023. 页码 3239-3248. (Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining). doi: 10.1145/3580305.3599379

HiMacMic: Hierarchical Multi-Agent Deep Reinforcement Learning with Dynamic Asynchronous Macro Strategy

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此