Time-attenuating Twin Delayed DDPG for Quadrotor Tracking Control

Boyuan Deng; Jian Sun; Zhuo Li; Gang Wang

doi:10.23919/CCC58697.2023.10241100

Time-attenuating Twin Delayed DDPG for Quadrotor Tracking Control

Boyuan Deng, Jian Sun^*, Zhuo Li, Gang Wang

^*此作品的通讯作者

自动化学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Continuous trajectory tracking control of quadrotors is challenging when considering noise from the environment. Due to the difficulty in modeling the environmental dynamics, tracking methodologies based on conventional control theory, such as model predictive control, have limitations on tracking accuracy and response time. We propose a time-attenuating twin delayed DDPG, a model-free algorithm that is robust to noise, to better handle the trajectory tracking task. A deep reinforcement learning framework is constructed, where a time decay strategy is designed to avoid trapping into local optima. The experimental results show that the tracking error is significantly small, and the operation time is one-tenth of that of a traditional algorithm. The OpenAI Mujoco[1] tool is used to verify the proposed algorithm, and the simulation results show that, the proposed method can significantly improve the training efficiency and effectively improve the accuracy and convergence stability.

源语言	英语
主期刊名	2023 42nd Chinese Control Conference, CCC 2023
出版商	IEEE Computer Society
页	2323-2328
页数	6
ISBN（电子版）	9789887581543
DOI	https://doi.org/10.23919/CCC58697.2023.10241100
出版状态	已出版 - 2023
活动	42nd Chinese Control Conference, CCC 2023 - Tianjin, 中国期限: 24 7月 2023 → 26 7月 2023

出版系列

姓名	Chinese Control Conference, CCC
卷	2023-July
ISSN（印刷版）	1934-1768
ISSN（电子版）	2161-2927

会议

会议	42nd Chinese Control Conference, CCC 2023
国家/地区	中国
市	Tianjin
时期	24/07/23 → 26/07/23

访问文件

10.23919/CCC58697.2023.10241100

其它文件与链接

链接到 Scopus 的出版物

引用此

@inproceedings{47155073f6924a63944052312509cb13,

title = "Time-attenuating Twin Delayed DDPG for Quadrotor Tracking Control",

abstract = "Continuous trajectory tracking control of quadrotors is challenging when considering noise from the environment. Due to the difficulty in modeling the environmental dynamics, tracking methodologies based on conventional control theory, such as model predictive control, have limitations on tracking accuracy and response time. We propose a time-attenuating twin delayed DDPG, a model-free algorithm that is robust to noise, to better handle the trajectory tracking task. A deep reinforcement learning framework is constructed, where a time decay strategy is designed to avoid trapping into local optima. The experimental results show that the tracking error is significantly small, and the operation time is one-tenth of that of a traditional algorithm. The OpenAI Mujoco[1] tool is used to verify the proposed algorithm, and the simulation results show that, the proposed method can significantly improve the training efficiency and effectively improve the accuracy and convergence stability.",

keywords = "deep reinforcement learning, quadrotor, trajectory tracking control",

author = "Boyuan Deng and Jian Sun and Zhuo Li and Gang Wang",

note = "Publisher Copyright: {\textcopyright} 2023 Technical Committee on Control Theory, Chinese Association of Automation.; 42nd Chinese Control Conference, CCC 2023 ; Conference date: 24-07-2023 Through 26-07-2023",

year = "2023",

doi = "10.23919/CCC58697.2023.10241100",

language = "English",

series = "Chinese Control Conference, CCC",

publisher = "IEEE Computer Society",

pages = "2323--2328",

booktitle = "2023 42nd Chinese Control Conference, CCC 2023",

address = "United States",

}

TY - GEN

T1 - Time-attenuating Twin Delayed DDPG for Quadrotor Tracking Control

AU - Deng, Boyuan

AU - Sun, Jian

AU - Li, Zhuo

AU - Wang, Gang

PY - 2023

Y1 - 2023

N2 - Continuous trajectory tracking control of quadrotors is challenging when considering noise from the environment. Due to the difficulty in modeling the environmental dynamics, tracking methodologies based on conventional control theory, such as model predictive control, have limitations on tracking accuracy and response time. We propose a time-attenuating twin delayed DDPG, a model-free algorithm that is robust to noise, to better handle the trajectory tracking task. A deep reinforcement learning framework is constructed, where a time decay strategy is designed to avoid trapping into local optima. The experimental results show that the tracking error is significantly small, and the operation time is one-tenth of that of a traditional algorithm. The OpenAI Mujoco[1] tool is used to verify the proposed algorithm, and the simulation results show that, the proposed method can significantly improve the training efficiency and effectively improve the accuracy and convergence stability.

AB - Continuous trajectory tracking control of quadrotors is challenging when considering noise from the environment. Due to the difficulty in modeling the environmental dynamics, tracking methodologies based on conventional control theory, such as model predictive control, have limitations on tracking accuracy and response time. We propose a time-attenuating twin delayed DDPG, a model-free algorithm that is robust to noise, to better handle the trajectory tracking task. A deep reinforcement learning framework is constructed, where a time decay strategy is designed to avoid trapping into local optima. The experimental results show that the tracking error is significantly small, and the operation time is one-tenth of that of a traditional algorithm. The OpenAI Mujoco[1] tool is used to verify the proposed algorithm, and the simulation results show that, the proposed method can significantly improve the training efficiency and effectively improve the accuracy and convergence stability.

KW - deep reinforcement learning

KW - quadrotor

KW - trajectory tracking control

UR - http://www.scopus.com/inward/record.url?scp=85175535581&partnerID=8YFLogxK

U2 - 10.23919/CCC58697.2023.10241100

DO - 10.23919/CCC58697.2023.10241100

M3 - Conference contribution

AN - SCOPUS:85175535581

T3 - Chinese Control Conference, CCC

SP - 2323

EP - 2328

BT - 2023 42nd Chinese Control Conference, CCC 2023

PB - IEEE Computer Society

T2 - 42nd Chinese Control Conference, CCC 2023

Y2 - 24 July 2023 through 26 July 2023

ER -

Time-attenuating Twin Delayed DDPG for Quadrotor Tracking Control

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此