Learning Aerial Docking via Offline-to-Online Reinforcement Learning

Yang Tao, Feng Yuting, Yushu Yu*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this study, we explore the intricate task of executing a docking operation between two quadrotors, utilizing a blend of offline and online reinforcement learning techniques. This multifaceted task is fraught with the catastrophic forgetting problem due to its multiple objectives: it necessitates precise control over the trajectory of the quadrotors, ensures their stability during the docking process, and maintains stable control post-docking. The complexity of these objectives presents significant challenges when applying reinforcement learning directly, often leading to unsuccessful outcomes. To navigate these challenges, we initially developed a rule-based expert controller and amassed a substantial dataset. Subsequently, we employed offline reinforcement learning to train a guided policy, which was then fine-tuned using online reinforcement learning. This approach effectively addressed the out-of-distribution issues typically encountered in online reinforcement learning with guided policies. Notably, our methodology significantly enhanced the success rate of the expert strategy, boosting it from 40% to an impressive 95%1,.

Original languageEnglish
Title of host publication2024 4th International Conference on Computer, Control and Robotics, ICCCR 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages305-309
Number of pages5
ISBN (Electronic)9798350373141
DOIs
Publication statusPublished - 2024
Event4th International Conference on Computer, Control and Robotics, ICCCR 2024 - Shanghai, China
Duration: 19 Apr 202421 Apr 2024

Publication series

Name2024 4th International Conference on Computer, Control and Robotics, ICCCR 2024

Conference

Conference4th International Conference on Computer, Control and Robotics, ICCCR 2024
Country/TerritoryChina
CityShanghai
Period19/04/2421/04/24

Keywords

  • control
  • quadrotor
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Learning Aerial Docking via Offline-to-Online Reinforcement Learning'. Together they form a unique fingerprint.

Cite this