Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification

Jinhui Pang; Zicong Feng

doi:10.1117/12.3012024

Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification

Jinhui Pang^*, Zicong Feng

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Uncertainty quantification is an essential method for sample-efficient deep reinforcement learning. There is a growing literature on uncertainty-based deep reinforcement learning algorithms, but many of the previous approaches failed to capture different sources of uncertainty. We highlight why this can be a crucial shortcoming for sample-efficient algorithms and provide a sophisticated analysis of the uncertainty in the interaction between agent and environment. Based on that, we propose Weighted Bootstrapped DQN, an exploration-efficient method that combines network ensembles and variance weighting. We use aleatoric uncertainty estimation together with epistemic uncertainty to improve the exploration ability of the algorithm. We prove that our new approach has a significant improvement in sample efficiency on different gym tasks, even compared with the previous state-of-the-art approaches.

Original language	English
Title of host publication	International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023
Editors	Sandeep Saxena, Cairong Zhao
Publisher	SPIE
ISBN (Electronic)	9781510671881
DOIs	https://doi.org/10.1117/12.3012024
Publication status	Published - 2023
Event	2023 International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023 - Yinchuan, China Duration: 18 Aug 2023 → 19 Aug 2023

Publication series

Name	Proceedings of SPIE - The International Society for Optical Engineering
Volume	12941
ISSN (Print)	0277-786X
ISSN (Electronic)	1996-756X

Conference

Conference	2023 International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023
Country/Territory	China
City	Yinchuan
Period	18/08/23 → 19/08/23

Keywords

Deep Reinforcement Learning
Exploration
Uncertainty Quantification

Access to Document

10.1117/12.3012024

Cite this

Pang, J., & Feng, Z. (2023). Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification. In S. Saxena, & C. Zhao (Eds.), International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023 Article 129411J (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 12941). SPIE. https://doi.org/10.1117/12.3012024

@inproceedings{1c0a7cebbac24d53977755b8f0fa9114,

title = "Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification",

abstract = "Uncertainty quantification is an essential method for sample-efficient deep reinforcement learning. There is a growing literature on uncertainty-based deep reinforcement learning algorithms, but many of the previous approaches failed to capture different sources of uncertainty. We highlight why this can be a crucial shortcoming for sample-efficient algorithms and provide a sophisticated analysis of the uncertainty in the interaction between agent and environment. Based on that, we propose Weighted Bootstrapped DQN, an exploration-efficient method that combines network ensembles and variance weighting. We use aleatoric uncertainty estimation together with epistemic uncertainty to improve the exploration ability of the algorithm. We prove that our new approach has a significant improvement in sample efficiency on different gym tasks, even compared with the previous state-of-the-art approaches.",

keywords = "Deep Reinforcement Learning, Exploration, Uncertainty Quantification",

author = "Jinhui Pang and Zicong Feng",

note = "Publisher Copyright: {\textcopyright} 2023 SPIE. All rights reserved.; 2023 International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023 ; Conference date: 18-08-2023 Through 19-08-2023",

year = "2023",

doi = "10.1117/12.3012024",

language = "English",

series = "Proceedings of SPIE - The International Society for Optical Engineering",

publisher = "SPIE",

editor = "Sandeep Saxena and Cairong Zhao",

booktitle = "International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023",

address = "United States",

}

Pang, J & Feng, Z 2023, Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification. in S Saxena & C Zhao (eds), International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023., 129411J, Proceedings of SPIE - The International Society for Optical Engineering, vol. 12941, SPIE, 2023 International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023, Yinchuan, China, 18/08/23. https://doi.org/10.1117/12.3012024

Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification. / Pang, Jinhui; Feng, Zicong.
International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023. ed. / Sandeep Saxena; Cairong Zhao. SPIE, 2023. 129411J (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 12941).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Weighted Bootstrapped DQN

T2 - 2023 International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023

AU - Pang, Jinhui

AU - Feng, Zicong

PY - 2023

Y1 - 2023

N2 - Uncertainty quantification is an essential method for sample-efficient deep reinforcement learning. There is a growing literature on uncertainty-based deep reinforcement learning algorithms, but many of the previous approaches failed to capture different sources of uncertainty. We highlight why this can be a crucial shortcoming for sample-efficient algorithms and provide a sophisticated analysis of the uncertainty in the interaction between agent and environment. Based on that, we propose Weighted Bootstrapped DQN, an exploration-efficient method that combines network ensembles and variance weighting. We use aleatoric uncertainty estimation together with epistemic uncertainty to improve the exploration ability of the algorithm. We prove that our new approach has a significant improvement in sample efficiency on different gym tasks, even compared with the previous state-of-the-art approaches.

AB - Uncertainty quantification is an essential method for sample-efficient deep reinforcement learning. There is a growing literature on uncertainty-based deep reinforcement learning algorithms, but many of the previous approaches failed to capture different sources of uncertainty. We highlight why this can be a crucial shortcoming for sample-efficient algorithms and provide a sophisticated analysis of the uncertainty in the interaction between agent and environment. Based on that, we propose Weighted Bootstrapped DQN, an exploration-efficient method that combines network ensembles and variance weighting. We use aleatoric uncertainty estimation together with epistemic uncertainty to improve the exploration ability of the algorithm. We prove that our new approach has a significant improvement in sample efficiency on different gym tasks, even compared with the previous state-of-the-art approaches.

KW - Deep Reinforcement Learning

KW - Exploration

KW - Uncertainty Quantification

UR - http://www.scopus.com/inward/record.url?scp=85180125037&partnerID=8YFLogxK

U2 - 10.1117/12.3012024

DO - 10.1117/12.3012024

M3 - Conference contribution

AN - SCOPUS:85180125037

T3 - Proceedings of SPIE - The International Society for Optical Engineering

BT - International Conference on Algorithms, High Performance Computing, and Artificial Intelligence, AHPCAI 2023

A2 - Saxena, Sandeep

A2 - Zhao, Cairong

PB - SPIE

Y2 - 18 August 2023 through 19 August 2023

ER -

Weighted Bootstrapped DQN: Efficient Exploration via Uncertainty Quantification

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this