OB-HPPO: An Option and Intrinsic Curiosity Based Hierarchical Reinforcement Learning Approach for Real-Time Strategy Games

Ruilin Jiang, Yanlong Zhai*, Yan Zheng, You Li, Yanglin Liu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The multi-agent real-time strategy game problem is a classic problem in the field of reinforcement learning, and solving such a problem is of high instructive significance to the economic and military fields in real society. In recent years, researchers from many countries have made breakthroughs in the related problems, but most related technologies target specific environments or require high computing power platforms. This leads to an exponential increase in the time and resources consumed in training models when the complexity and scope of a task increases. In this paper, we proposed OB-HPPO, an option and intrinsic curiosity based hierarchical reinforcement learning framework to address these challenges. Our approach hierarchically decomposes a huge action space into several self-explainable options, simplifying atomic action decisions into a series of action decisions. OB-HPPO also introduces an intrinsic curiosity module (ICM) based on the Proximal Policy Optimization (PPO) algorithm to improve the efficiency of model training and exploration. Experimental results show that OB-HPPO takes less training time and accumulates more rewards than non-hierarchical models. We also test OB-HPPO against some representative AI models of the μRTS environment, and OB-HPPO's winning rate is significantly improved.

源语言英语
主期刊名Advanced Intelligent Computing Technology and Applications - 20th International Conference, ICIC 2024, Proceedings
编辑De-Shuang Huang, Yijie Pan, Xiankun Zhang
出版商Springer Science and Business Media Deutschland GmbH
443-454
页数12
ISBN(印刷版)9789819755806
DOI
出版状态已出版 - 2024
活动20th International Conference on Intelligent Computing, ICIC 2024 - Tianjin, 中国
期限: 5 8月 20248 8月 2024

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
14863 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议20th International Conference on Intelligent Computing, ICIC 2024
国家/地区中国
Tianjin
时期5/08/248/08/24

指纹

探究 'OB-HPPO: An Option and Intrinsic Curiosity Based Hierarchical Reinforcement Learning Approach for Real-Time Strategy Games' 的科研主题。它们共同构成独一无二的指纹。

引用此