An Extensive Application of Model Predictive Control Combined with Policy Search to Multi-agent Agile UAV Flight

Huaxing Xu, Chengwei Yang, Juan Li*, Chang Liu, Yu Yang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Reinforcement Learning (RL) methods can automatically learn complex policies with minimum prior knowledge about the task. Meanwhile, Model Predictive Control (MPC) can achieve excellent control performance with convincing safety and interpretation. Thus, there is a growing amount of research combining the advantages of both RL methods and MPC so that high accuracy and adaptability to highly dynamic environments, or in a word agility, can be achieved during various unmanned aerial vehicles (UAVs) flight tasks. However, current studies mostly solve the control problem for a single UAV. This paper extends such a framework to the problem of controlling a multi-agent system with an MPC controller whose decision variables represented as high-level policies are chosen by policy search. We validate the improved method by flying two drones through a moving gate simultaneously. Experiments in simulation demonstrate that the improved controller can preserve robust, in-time control performance and further avoid collision when there are multiple UAVs, showing a promising aspect for realizing multi-agent agile UAV flight.

源语言英语
主期刊名Advances in Guidance, Navigation and Control - Proceedings of 2022 International Conference on Guidance, Navigation and Control
编辑Liang Yan, Haibin Duan, Yimin Deng, Liang Yan
出版商Springer Science and Business Media Deutschland GmbH
1367-1378
页数12
ISBN(印刷版)9789811966125
DOI
出版状态已出版 - 2023
活动International Conference on Guidance, Navigation and Control, ICGNC 2022 - Harbin, 中国
期限: 5 8月 20227 8月 2022

出版系列

姓名Lecture Notes in Electrical Engineering
845 LNEE
ISSN(印刷版)1876-1100
ISSN(电子版)1876-1119

会议

会议International Conference on Guidance, Navigation and Control, ICGNC 2022
国家/地区中国
Harbin
时期5/08/227/08/22

指纹

探究 'An Extensive Application of Model Predictive Control Combined with Policy Search to Multi-agent Agile UAV Flight' 的科研主题。它们共同构成独一无二的指纹。

引用此