TY - GEN
T1 - Zero-Shot Sim-To-Real Transfer of Robust and Generic Quadrotor Controller by Deep Reinforcement Learning
AU - Zhang, Meina
AU - Li, Mingyang
AU - Wang, Kaidi
AU - Yang, Tao
AU - Feng, Yuting
AU - Yu, Yushu
N1 - Publisher Copyright:
© 2024, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2024
Y1 - 2024
N2 - The goal of this paper is to develop a controller that can be trained in a simulation environment and seamlessly applied to different types of real-world quadrotors without requiring any additional adaptation or fine-tuning. First, a training environment framework for a generic quadrotor based on the high-fidelity dynamics model is designed. The input for the training environment consists of angular velocity and thrust. Next, the policy network and the detailed policy learning procedure are presented. The training process includes investigating and mitigating differences in dynamics, sensor noise, and environmental conditions between the simulation and real-world quadrotor systems. Efforts are also made to increase the continuity of the action output from the policy during training. The efficiency of the proposed approach is demonstrated through a series of real-world experiments. The trained controller exhibits remarkable robustness and versatility across different quadrotor models, successfully completing flight tasks in real-world scenarios without requiring additional training or modifications. These results highlight the potential of deep reinforcement learning for achieving zero-shot sim-to-real transfer in the domain of quadrotor control.
AB - The goal of this paper is to develop a controller that can be trained in a simulation environment and seamlessly applied to different types of real-world quadrotors without requiring any additional adaptation or fine-tuning. First, a training environment framework for a generic quadrotor based on the high-fidelity dynamics model is designed. The input for the training environment consists of angular velocity and thrust. Next, the policy network and the detailed policy learning procedure are presented. The training process includes investigating and mitigating differences in dynamics, sensor noise, and environmental conditions between the simulation and real-world quadrotor systems. Efforts are also made to increase the continuity of the action output from the policy during training. The efficiency of the proposed approach is demonstrated through a series of real-world experiments. The trained controller exhibits remarkable robustness and versatility across different quadrotor models, successfully completing flight tasks in real-world scenarios without requiring additional training or modifications. These results highlight the potential of deep reinforcement learning for achieving zero-shot sim-to-real transfer in the domain of quadrotor control.
KW - Quadrotor Control
KW - Reinforcement Learning
KW - Sim-to-real Transfer
UR - http://www.scopus.com/inward/record.url?scp=85176922977&partnerID=8YFLogxK
U2 - 10.1007/978-981-99-8021-5_3
DO - 10.1007/978-981-99-8021-5_3
M3 - Conference contribution
AN - SCOPUS:85176922977
SN - 9789819980208
T3 - Communications in Computer and Information Science
SP - 27
EP - 43
BT - Cognitive Systems and Information Processing - 8th International Conference, ICCSIP 2023, Revised Selected Papers
A2 - Sun, Fuchun
A2 - Fang, Bin
A2 - Meng, Qinghu
A2 - Fu, Zhumu
PB - Springer Science and Business Media Deutschland GmbH
T2 - 8th International Conference on Cognitive Systems and Information Processing, ICCSIP 2023
Y2 - 10 August 2023 through 12 August 2023
ER -