TY - GEN
T1 - Brain Inspired Episodic Memory Deep Q-Networks for Sparse Reward
AU - Wu, Xinyu
AU - Fan, Chaoqiong
AU - Jia, Tianyuan
AU - Wu, Xia
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Although deep reinforcement learning has achieved great success in recent years, it still suffers from slow convergence, low sample efficiency, and large computational resources due to the existence of reward sparsity in many decision problems. Therefore, exploring more effective algorithms that can cope with sparse rewards is of great importance. Episodic memory reinforcement learning has received a lot of attention for its ability to collect past dominant strategies, which can serve as successful experiences to efficiently guide agents in sparse reward environments. In this paper, the episodic memory deep Q-networks, which incorporates the fast convergence property of episodic memory into neural networks, is employed to solve decision making problem with sparse rewards. Atari games are the test beds. Experiments show that the episodic memory deep Q-networks outperforms the deep Q-networks and the prioritized experience replay, which demonstrates the sample efficiency and the effectiveness of episodic memory deep Q-networks for the sparse reward problem.
AB - Although deep reinforcement learning has achieved great success in recent years, it still suffers from slow convergence, low sample efficiency, and large computational resources due to the existence of reward sparsity in many decision problems. Therefore, exploring more effective algorithms that can cope with sparse rewards is of great importance. Episodic memory reinforcement learning has received a lot of attention for its ability to collect past dominant strategies, which can serve as successful experiences to efficiently guide agents in sparse reward environments. In this paper, the episodic memory deep Q-networks, which incorporates the fast convergence property of episodic memory into neural networks, is employed to solve decision making problem with sparse rewards. Atari games are the test beds. Experiments show that the episodic memory deep Q-networks outperforms the deep Q-networks and the prioritized experience replay, which demonstrates the sample efficiency and the effectiveness of episodic memory deep Q-networks for the sparse reward problem.
KW - deep q-networks
KW - episodic memory
KW - reinforcement learning
KW - sparse reward
UR - http://www.scopus.com/inward/record.url?scp=85189749500&partnerID=8YFLogxK
U2 - 10.1109/ICNC59488.2023.10462817
DO - 10.1109/ICNC59488.2023.10462817
M3 - Conference contribution
AN - SCOPUS:85189749500
T3 - 2023 International Conference on Neuromorphic Computing, ICNC 2023
SP - 430
EP - 434
BT - 2023 International Conference on Neuromorphic Computing, ICNC 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 International Conference on Neuromorphic Computing, ICNC 2023
Y2 - 15 December 2023 through 17 December 2023
ER -