Time-attenuating Twin Delayed DDPG for Quadrotor Tracking Control

Boyuan Deng, Jian Sun*, Zhuo Li, Gang Wang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Continuous trajectory tracking control of quadrotors is challenging when considering noise from the environment. Due to the difficulty in modeling the environmental dynamics, tracking methodologies based on conventional control theory, such as model predictive control, have limitations on tracking accuracy and response time. We propose a time-attenuating twin delayed DDPG, a model-free algorithm that is robust to noise, to better handle the trajectory tracking task. A deep reinforcement learning framework is constructed, where a time decay strategy is designed to avoid trapping into local optima. The experimental results show that the tracking error is significantly small, and the operation time is one-tenth of that of a traditional algorithm. The OpenAI Mujoco[1] tool is used to verify the proposed algorithm, and the simulation results show that, the proposed method can significantly improve the training efficiency and effectively improve the accuracy and convergence stability.

Original languageEnglish
Title of host publication2023 42nd Chinese Control Conference, CCC 2023
PublisherIEEE Computer Society
Pages2323-2328
Number of pages6
ISBN (Electronic)9789887581543
DOIs
Publication statusPublished - 2023
Event42nd Chinese Control Conference, CCC 2023 - Tianjin, China
Duration: 24 Jul 202326 Jul 2023

Publication series

NameChinese Control Conference, CCC
Volume2023-July
ISSN (Print)1934-1768
ISSN (Electronic)2161-2927

Conference

Conference42nd Chinese Control Conference, CCC 2023
Country/TerritoryChina
CityTianjin
Period24/07/2326/07/23

Keywords

  • deep reinforcement learning
  • quadrotor
  • trajectory tracking control

Fingerprint

Dive into the research topics of 'Time-attenuating Twin Delayed DDPG for Quadrotor Tracking Control'. Together they form a unique fingerprint.

Cite this