Screening goals and selecting policies in hierarchical reinforcement learning

Junyan Zhou, Jing Chen*, Yanfeng Tong, Junrui Zhang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

Hierarchical Reinforcement Learning (HRL) is primarily proposed for addressing problems with sparse reward signals and a long time horizon. Many existing HRL algorithms use neural networks to automatically produce goals, which have not taken into account that not all goals advance the task. Some methods have addressed the optimization of goal generation, while the goal is represented by the specific value of the state. In this paper, we propose a novel HRL algorithm for automatically discovering goals, which solves the problem for the optimization of goal generation in the latent state space by screening goals and selecting policies. We compare our approach with the state-of-the-art algorithms on Atari 2600 games and the results show that it can speed up learning and improve performance.

源语言英语
页(从-至)18049-18060
页数12
期刊Applied Intelligence
52
15
DOI
出版状态已出版 - 12月 2022

指纹

探究 'Screening goals and selecting policies in hierarchical reinforcement learning' 的科研主题。它们共同构成独一无二的指纹。

引用此