Dual Optimization-Based Distributed Tracking Control Under Completely Unknown Dynamics

Di Mei; Jian Sun; Yong Xu; Lihua Dou

doi:10.1109/TASE.2024.3377894

Dual Optimization-Based Distributed Tracking Control Under Completely Unknown Dynamics

Di Mei, Jian Sun, Yong Xu, Lihua Dou

School of Automation

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

Unlike existing results on output tracking control in multi-agent systems, which mainly focus on the relative state among agents, in some cases the state of the system may not be available or measurable. In this paper, we investigate the model-free learning-based distributed optimal tracking control for heterogeneous multi-agent systems with dynamic output feedback under a switching reinforcement learning algorithm. First, a relative output-based distributed observer is developed without exchanging state information, which can significantly reduce the interaction load and broaden the range of applications. Then, a distributed feedback-feedforward controller is proposed, where the optimal feedback and feedforward gain matrices can be learned online by solving two optimization problems, employing policy iteration-based reinforcement learning instead of relying on the leader’s state as in existing studies. Subsequently, the policy iteration algorithm (PI) is modified into the value iteration (VI) learning algorithm, which can relax the requirement for an initial stabilizing control policy and does not depend on the known dynamical model. Additionally, a switching reinforcement learning algorithm is put forward by fully integrating the merits of the previously mentioned methods. The new algorithm not only overcomes the initial stabilizing assumption, but also ensures the convergence of the algorithm in a model-free fashion. Finally, a simulation example is provided to illustrate the theoretical analysis. <italic>Note to Practitioners</italic>—This paper investigates the model-free learning-based distributed optimal tracking control for heterogeneous multi-agent systems with dynamic output feedback under a switched reinforcement learning algorithm. The proposed distributed cooperative control algorithms can be applied to multiple ground vehicles, air vehicles, and underwater vehicles. Unlike existing results on output tracking control, the obtained results rely on accurate system dynamics and ignore the transient performance, which makes the designed controller far from optimal in potential applications. Moreover, the optimal learning algorithm strictly relies on the initial stabilizing control policy related to accurate system dynamics, and it may be helpless when the system model is completely unknown. To overcome those issues, we developed a new data-driven algorithm or data-based model-free reinforcement learning algorithm to study distributed output tracking control with relative output information by collecting system data instead of using accurate system dynamics. The new algorithm not only overcomes the initial stabilizing assumption but also ensures the algorithm convergence in a model-free fashion. Potential applications of the proposed control algorithms include cooperative formation control, and secondary control of microgrids.

Original language	English
Pages (from-to)	1-11
Number of pages	11
Journal	IEEE Transactions on Automation Science and Engineering
DOIs	https://doi.org/10.1109/TASE.2024.3377894
Publication status	Accepted/In press - 2024

Keywords

Control systems
Heterogeneous multi-agent systems (HMASs)
Heuristic algorithms
Multi-agent systems
Observers
Regulation
Reinforcement learning
Switches
distributed cooperative control
unknown dynamics

Access to Document

10.1109/TASE.2024.3377894

Cite this

Mei, D., Sun, J., Xu, Y., & Dou, L. (Accepted/In press). Dual Optimization-Based Distributed Tracking Control Under Completely Unknown Dynamics. IEEE Transactions on Automation Science and Engineering, 1-11. https://doi.org/10.1109/TASE.2024.3377894

@article{80282dae2dfa495da3f2815d908e9bd6,

title = "Dual Optimization-Based Distributed Tracking Control Under Completely Unknown Dynamics",

abstract = "Unlike existing results on output tracking control in multi-agent systems, which mainly focus on the relative state among agents, in some cases the state of the system may not be available or measurable. In this paper, we investigate the model-free learning-based distributed optimal tracking control for heterogeneous multi-agent systems with dynamic output feedback under a switching reinforcement learning algorithm. First, a relative output-based distributed observer is developed without exchanging state information, which can significantly reduce the interaction load and broaden the range of applications. Then, a distributed feedback-feedforward controller is proposed, where the optimal feedback and feedforward gain matrices can be learned online by solving two optimization problems, employing policy iteration-based reinforcement learning instead of relying on the leader{\textquoteright}s state as in existing studies. Subsequently, the policy iteration algorithm (PI) is modified into the value iteration (VI) learning algorithm, which can relax the requirement for an initial stabilizing control policy and does not depend on the known dynamical model. Additionally, a switching reinforcement learning algorithm is put forward by fully integrating the merits of the previously mentioned methods. The new algorithm not only overcomes the initial stabilizing assumption, but also ensures the convergence of the algorithm in a model-free fashion. Finally, a simulation example is provided to illustrate the theoretical analysis. Note to Practitioners—This paper investigates the model-free learning-based distributed optimal tracking control for heterogeneous multi-agent systems with dynamic output feedback under a switched reinforcement learning algorithm. The proposed distributed cooperative control algorithms can be applied to multiple ground vehicles, air vehicles, and underwater vehicles. Unlike existing results on output tracking control, the obtained results rely on accurate system dynamics and ignore the transient performance, which makes the designed controller far from optimal in potential applications. Moreover, the optimal learning algorithm strictly relies on the initial stabilizing control policy related to accurate system dynamics, and it may be helpless when the system model is completely unknown. To overcome those issues, we developed a new data-driven algorithm or data-based model-free reinforcement learning algorithm to study distributed output tracking control with relative output information by collecting system data instead of using accurate system dynamics. The new algorithm not only overcomes the initial stabilizing assumption but also ensures the algorithm convergence in a model-free fashion. Potential applications of the proposed control algorithms include cooperative formation control, and secondary control of microgrids.",

keywords = "Control systems, Heterogeneous multi-agent systems (HMASs), Heuristic algorithms, Multi-agent systems, Observers, Regulation, Reinforcement learning, Switches, distributed cooperative control, unknown dynamics",

author = "Di Mei and Jian Sun and Yong Xu and Lihua Dou",

note = "Publisher Copyright: IEEE",

year = "2024",

doi = "10.1109/TASE.2024.3377894",

language = "English",

pages = "1--11",

journal = "IEEE Transactions on Automation Science and Engineering",

issn = "1545-5955",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Dual Optimization-Based Distributed Tracking Control Under Completely Unknown Dynamics

AU - Mei, Di

AU - Sun, Jian

AU - Xu, Yong

AU - Dou, Lihua

N1 - Publisher Copyright: IEEE

PY - 2024

Y1 - 2024

N2 - Unlike existing results on output tracking control in multi-agent systems, which mainly focus on the relative state among agents, in some cases the state of the system may not be available or measurable. In this paper, we investigate the model-free learning-based distributed optimal tracking control for heterogeneous multi-agent systems with dynamic output feedback under a switching reinforcement learning algorithm. First, a relative output-based distributed observer is developed without exchanging state information, which can significantly reduce the interaction load and broaden the range of applications. Then, a distributed feedback-feedforward controller is proposed, where the optimal feedback and feedforward gain matrices can be learned online by solving two optimization problems, employing policy iteration-based reinforcement learning instead of relying on the leader’s state as in existing studies. Subsequently, the policy iteration algorithm (PI) is modified into the value iteration (VI) learning algorithm, which can relax the requirement for an initial stabilizing control policy and does not depend on the known dynamical model. Additionally, a switching reinforcement learning algorithm is put forward by fully integrating the merits of the previously mentioned methods. The new algorithm not only overcomes the initial stabilizing assumption, but also ensures the convergence of the algorithm in a model-free fashion. Finally, a simulation example is provided to illustrate the theoretical analysis. Note to Practitioners—This paper investigates the model-free learning-based distributed optimal tracking control for heterogeneous multi-agent systems with dynamic output feedback under a switched reinforcement learning algorithm. The proposed distributed cooperative control algorithms can be applied to multiple ground vehicles, air vehicles, and underwater vehicles. Unlike existing results on output tracking control, the obtained results rely on accurate system dynamics and ignore the transient performance, which makes the designed controller far from optimal in potential applications. Moreover, the optimal learning algorithm strictly relies on the initial stabilizing control policy related to accurate system dynamics, and it may be helpless when the system model is completely unknown. To overcome those issues, we developed a new data-driven algorithm or data-based model-free reinforcement learning algorithm to study distributed output tracking control with relative output information by collecting system data instead of using accurate system dynamics. The new algorithm not only overcomes the initial stabilizing assumption but also ensures the algorithm convergence in a model-free fashion. Potential applications of the proposed control algorithms include cooperative formation control, and secondary control of microgrids.

AB - Unlike existing results on output tracking control in multi-agent systems, which mainly focus on the relative state among agents, in some cases the state of the system may not be available or measurable. In this paper, we investigate the model-free learning-based distributed optimal tracking control for heterogeneous multi-agent systems with dynamic output feedback under a switching reinforcement learning algorithm. First, a relative output-based distributed observer is developed without exchanging state information, which can significantly reduce the interaction load and broaden the range of applications. Then, a distributed feedback-feedforward controller is proposed, where the optimal feedback and feedforward gain matrices can be learned online by solving two optimization problems, employing policy iteration-based reinforcement learning instead of relying on the leader’s state as in existing studies. Subsequently, the policy iteration algorithm (PI) is modified into the value iteration (VI) learning algorithm, which can relax the requirement for an initial stabilizing control policy and does not depend on the known dynamical model. Additionally, a switching reinforcement learning algorithm is put forward by fully integrating the merits of the previously mentioned methods. The new algorithm not only overcomes the initial stabilizing assumption, but also ensures the convergence of the algorithm in a model-free fashion. Finally, a simulation example is provided to illustrate the theoretical analysis. Note to Practitioners—This paper investigates the model-free learning-based distributed optimal tracking control for heterogeneous multi-agent systems with dynamic output feedback under a switched reinforcement learning algorithm. The proposed distributed cooperative control algorithms can be applied to multiple ground vehicles, air vehicles, and underwater vehicles. Unlike existing results on output tracking control, the obtained results rely on accurate system dynamics and ignore the transient performance, which makes the designed controller far from optimal in potential applications. Moreover, the optimal learning algorithm strictly relies on the initial stabilizing control policy related to accurate system dynamics, and it may be helpless when the system model is completely unknown. To overcome those issues, we developed a new data-driven algorithm or data-based model-free reinforcement learning algorithm to study distributed output tracking control with relative output information by collecting system data instead of using accurate system dynamics. The new algorithm not only overcomes the initial stabilizing assumption but also ensures the algorithm convergence in a model-free fashion. Potential applications of the proposed control algorithms include cooperative formation control, and secondary control of microgrids.

KW - Control systems

KW - Heterogeneous multi-agent systems (HMASs)

KW - Heuristic algorithms

KW - Multi-agent systems

KW - Observers

KW - Regulation

KW - Reinforcement learning

KW - Switches

KW - distributed cooperative control

KW - unknown dynamics

UR - http://www.scopus.com/inward/record.url?scp=85188738081&partnerID=8YFLogxK

U2 - 10.1109/TASE.2024.3377894

DO - 10.1109/TASE.2024.3377894

M3 - Article

AN - SCOPUS:85188738081

SN - 1545-5955

SP - 1

EP - 11

JO - IEEE Transactions on Automation Science and Engineering

JF - IEEE Transactions on Automation Science and Engineering

ER -

Dual Optimization-Based Distributed Tracking Control Under Completely Unknown Dynamics

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this