TY - GEN
T1 - An Extensive Application of Model Predictive Control Combined with Policy Search to Multi-agent Agile UAV Flight
AU - Xu, Huaxing
AU - Yang, Chengwei
AU - Li, Juan
AU - Liu, Chang
AU - Yang, Yu
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2023
Y1 - 2023
N2 - Reinforcement Learning (RL) methods can automatically learn complex policies with minimum prior knowledge about the task. Meanwhile, Model Predictive Control (MPC) can achieve excellent control performance with convincing safety and interpretation. Thus, there is a growing amount of research combining the advantages of both RL methods and MPC so that high accuracy and adaptability to highly dynamic environments, or in a word agility, can be achieved during various unmanned aerial vehicles (UAVs) flight tasks. However, current studies mostly solve the control problem for a single UAV. This paper extends such a framework to the problem of controlling a multi-agent system with an MPC controller whose decision variables represented as high-level policies are chosen by policy search. We validate the improved method by flying two drones through a moving gate simultaneously. Experiments in simulation demonstrate that the improved controller can preserve robust, in-time control performance and further avoid collision when there are multiple UAVs, showing a promising aspect for realizing multi-agent agile UAV flight.
AB - Reinforcement Learning (RL) methods can automatically learn complex policies with minimum prior knowledge about the task. Meanwhile, Model Predictive Control (MPC) can achieve excellent control performance with convincing safety and interpretation. Thus, there is a growing amount of research combining the advantages of both RL methods and MPC so that high accuracy and adaptability to highly dynamic environments, or in a word agility, can be achieved during various unmanned aerial vehicles (UAVs) flight tasks. However, current studies mostly solve the control problem for a single UAV. This paper extends such a framework to the problem of controlling a multi-agent system with an MPC controller whose decision variables represented as high-level policies are chosen by policy search. We validate the improved method by flying two drones through a moving gate simultaneously. Experiments in simulation demonstrate that the improved controller can preserve robust, in-time control performance and further avoid collision when there are multiple UAVs, showing a promising aspect for realizing multi-agent agile UAV flight.
KW - Agile UAV flight
KW - Model predictive control
KW - Multi-agent system
KW - Policy search
UR - http://www.scopus.com/inward/record.url?scp=85151140381&partnerID=8YFLogxK
U2 - 10.1007/978-981-19-6613-2_135
DO - 10.1007/978-981-19-6613-2_135
M3 - Conference contribution
AN - SCOPUS:85151140381
SN - 9789811966125
T3 - Lecture Notes in Electrical Engineering
SP - 1367
EP - 1378
BT - Advances in Guidance, Navigation and Control - Proceedings of 2022 International Conference on Guidance, Navigation and Control
A2 - Yan, Liang
A2 - Duan, Haibin
A2 - Deng, Yimin
A2 - Yan, Liang
PB - Springer Science and Business Media Deutschland GmbH
T2 - International Conference on Guidance, Navigation and Control, ICGNC 2022
Y2 - 5 August 2022 through 7 August 2022
ER -