Connecting Model-Based and Model-Free Control with Emotion Modulation in Learning Systems

Xiao Huang, Wei Wu, Hong Qiao*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

14 引用 (Scopus)

摘要

This article proposes a novel decision-making framework that bridges a gap between model-based (MB) and model-free (MF) control processes through only adjusting the planning horizon. Specifically, the output policy is obtained by solving a model predictive control problem with a locally optimal state value as terminal constraints. When the planning horizon decreases to zero, the MB control will transform into the MF control smoothly. Meanwhile, inspired by the neural mechanism of emotion modulation on decision-making, we build a biologically plausible computational model of emotion processing. This model can generate an uncertainty-related emotional response on the basis of the state prediction error and reward prediction error, and then dynamically modulates the planning horizon in the tasks. The simulation results demonstrate that the proposed decision-making framework can produce better policies than traditional methods. Emotion modulation can shift the MB and MF control well to improve the learning efficiency and the speed of decision-making.

源语言英语
文章编号8876861
页(从-至)4624-4638
页数15
期刊IEEE Transactions on Systems, Man, and Cybernetics: Systems
51
8
DOI
出版状态已出版 - 8月 2021
已对外发布

指纹

探究 'Connecting Model-Based and Model-Free Control with Emotion Modulation in Learning Systems' 的科研主题。它们共同构成独一无二的指纹。

引用此