跳到主要导航 跳到搜索 跳到主要内容

Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification

  • Beijing Institute of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Uncertainty quantification is an essential method for sample-efficient deep reinforcement learning. There is a growing literature on uncertainty-based deep reinforcement learning algorithms, but many of the previous approaches failed to capture different sources of uncertainty. We highlight why this can be a crucial shortcoming for sample-efficient algorithms and provide a sophisticated analysis of the uncertainty in the interaction between agent and environment. Based on that, we propose Weighted Bootstrapped DQN, an exploration-efficient method that combines network ensembles and variance weighting. We use aleatoric uncertainty estimation together with epistemic uncertainty to improve the exploration ability of the algorithm. We prove that our new approach has a significant improvement in sample efficiency on different gym tasks, even compared with the previous state-of-the-art approaches.

源语言英语
主期刊名International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023
编辑Sandeep Saxena, Cairong Zhao
出版商SPIE
ISBN(电子版)9781510671881
DOI
出版状态已出版 - 2023
活动2023 International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023 - Yinchuan, 中国
期限: 18 8月 202319 8月 2023

出版系列

姓名Proceedings of SPIE - The International Society for Optical Engineering
12941
ISSN(印刷版)0277-786X
ISSN(电子版)1996-756X

会议

会议2023 International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023
国家/地区中国
Yinchuan
时期18/08/2319/08/23

指纹

探究 'Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification' 的科研主题。它们共同构成独一无二的指纹。

引用此