Learning Aerial Docking via Offline-to-Online Reinforcement Learning

Yang Tao, Feng Yuting, Yushu Yu*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In this study, we explore the intricate task of executing a docking operation between two quadrotors, utilizing a blend of offline and online reinforcement learning techniques. This multifaceted task is fraught with the catastrophic forgetting problem due to its multiple objectives: it necessitates precise control over the trajectory of the quadrotors, ensures their stability during the docking process, and maintains stable control post-docking. The complexity of these objectives presents significant challenges when applying reinforcement learning directly, often leading to unsuccessful outcomes. To navigate these challenges, we initially developed a rule-based expert controller and amassed a substantial dataset. Subsequently, we employed offline reinforcement learning to train a guided policy, which was then fine-tuned using online reinforcement learning. This approach effectively addressed the out-of-distribution issues typically encountered in online reinforcement learning with guided policies. Notably, our methodology significantly enhanced the success rate of the expert strategy, boosting it from 40% to an impressive 95%1,.

源语言英语
主期刊名2024 4th International Conference on Computer, Control and Robotics, ICCCR 2024
出版商Institute of Electrical and Electronics Engineers Inc.
305-309
页数5
ISBN(电子版)9798350373141
DOI
出版状态已出版 - 2024
活动4th International Conference on Computer, Control and Robotics, ICCCR 2024 - Shanghai, 中国
期限: 19 4月 202421 4月 2024

出版系列

姓名2024 4th International Conference on Computer, Control and Robotics, ICCCR 2024

会议

会议4th International Conference on Computer, Control and Robotics, ICCCR 2024
国家/地区中国
Shanghai
时期19/04/2421/04/24

指纹

探究 'Learning Aerial Docking via Offline-to-Online Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此