强化学习与自适应动态规划: 从基础理论到多智能体系统中的应用进展综述

Translated title of the contribution: Reinforcement learning and adaptive/approximate dynamic programming: A survey from theory to applications in multi-agent systems

Guang Hui Wen, Tao Yang*, Jia Ling Zhou, Jun Jie Fu, Lei Xu

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

7 Citations (Scopus)

Abstract

Reinforcement learning (RL) and adaptive/approximate dynamic programming (ADP) algorithms have recently received much attention from various scientific fields (e.g., artificial intelligence, systems and control, and applied mathematics). This is partly due to their successful applications in a series of challenging problems, such as the sequential decision and optimal coordination control problems of large-scale multi-agent systems. In this paper, some preliminaries on RL and ADP algorithms are firstly introduced, and then the developments of these two closely related algorithms in different research fields are reviewed respectively, with emphasis on the developments from solving the sequential decision (optimal control) problem for single agent (control plant) to the sequential decision (optimal coordination control) problem of multi-agent systems by utilizing these two algorithms. Furthermore, after briefly surveying the structure evolution of the ADP algorithm in the last decades and the recent development of the ADP algorithm from model-based offline programming framework to model-free online learning framework, the research progress of the ADP algorithm in solving the optimal coordination control problem of multi-agent systems is reviewed. Finally, some interesting yet challenging issues on MARL algorithms and using ADP algorithms to solve optimal coordination control problem of multi-agent systems are suggested.

Translated title of the contributionReinforcement learning and adaptive/approximate dynamic programming: A survey from theory to applications in multi-agent systems
Original languageChinese (Traditional)
Pages (from-to)1200-1230
Number of pages31
JournalKongzhi yu Juece/Control and Decision
Volume38
Issue number5
DOIs
Publication statusPublished - May 2023

Fingerprint

Dive into the research topics of 'Reinforcement learning and adaptive/approximate dynamic programming: A survey from theory to applications in multi-agent systems'. Together they form a unique fingerprint.

Cite this