Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification

Jinhui Pang*, Zicong Feng

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Uncertainty quantification is an essential method for sample-efficient deep reinforcement learning. There is a growing literature on uncertainty-based deep reinforcement learning algorithms, but many of the previous approaches failed to capture different sources of uncertainty. We highlight why this can be a crucial shortcoming for sample-efficient algorithms and provide a sophisticated analysis of the uncertainty in the interaction between agent and environment. Based on that, we propose Weighted Bootstrapped DQN, an exploration-efficient method that combines network ensembles and variance weighting. We use aleatoric uncertainty estimation together with epistemic uncertainty to improve the exploration ability of the algorithm. We prove that our new approach has a significant improvement in sample efficiency on different gym tasks, even compared with the previous state-of-the-art approaches.

Original languageEnglish
Title of host publicationInternational Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023
EditorsSandeep Saxena, Cairong Zhao
PublisherSPIE
ISBN (Electronic)9781510671881
DOIs
Publication statusPublished - 2023
Event2023 International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023 - Yinchuan, China
Duration: 18 Aug 202319 Aug 2023

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume12941
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference2023 International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023
Country/TerritoryChina
CityYinchuan
Period18/08/2319/08/23

Keywords

  • Deep Reinforcement Learning
  • Exploration
  • Uncertainty Quantification

Fingerprint

Dive into the research topics of 'Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification'. Together they form a unique fingerprint.

Cite this