TY - JOUR
T1 - Connecting Model-Based and Model-Free Control with Emotion Modulation in Learning Systems
AU - Huang, Xiao
AU - Wu, Wei
AU - Qiao, Hong
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2021/8
Y1 - 2021/8
N2 - This article proposes a novel decision-making framework that bridges a gap between model-based (MB) and model-free (MF) control processes through only adjusting the planning horizon. Specifically, the output policy is obtained by solving a model predictive control problem with a locally optimal state value as terminal constraints. When the planning horizon decreases to zero, the MB control will transform into the MF control smoothly. Meanwhile, inspired by the neural mechanism of emotion modulation on decision-making, we build a biologically plausible computational model of emotion processing. This model can generate an uncertainty-related emotional response on the basis of the state prediction error and reward prediction error, and then dynamically modulates the planning horizon in the tasks. The simulation results demonstrate that the proposed decision-making framework can produce better policies than traditional methods. Emotion modulation can shift the MB and MF control well to improve the learning efficiency and the speed of decision-making.
AB - This article proposes a novel decision-making framework that bridges a gap between model-based (MB) and model-free (MF) control processes through only adjusting the planning horizon. Specifically, the output policy is obtained by solving a model predictive control problem with a locally optimal state value as terminal constraints. When the planning horizon decreases to zero, the MB control will transform into the MF control smoothly. Meanwhile, inspired by the neural mechanism of emotion modulation on decision-making, we build a biologically plausible computational model of emotion processing. This model can generate an uncertainty-related emotional response on the basis of the state prediction error and reward prediction error, and then dynamically modulates the planning horizon in the tasks. The simulation results demonstrate that the proposed decision-making framework can produce better policies than traditional methods. Emotion modulation can shift the MB and MF control well to improve the learning efficiency and the speed of decision-making.
KW - Brain-inspired computing
KW - decision-making
KW - emotion modulation
KW - emotion-cognition interactions
KW - reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85110542095&partnerID=8YFLogxK
U2 - 10.1109/TSMC.2019.2933152
DO - 10.1109/TSMC.2019.2933152
M3 - Article
AN - SCOPUS:85110542095
SN - 2168-2216
VL - 51
SP - 4624
EP - 4638
JO - IEEE Transactions on Systems, Man, and Cybernetics: Systems
JF - IEEE Transactions on Systems, Man, and Cybernetics: Systems
IS - 8
M1 - 8876861
ER -