TY - GEN
T1 - Time-attenuating Twin Delayed DDPG for Quadrotor Tracking Control
AU - Deng, Boyuan
AU - Sun, Jian
AU - Li, Zhuo
AU - Wang, Gang
N1 - Publisher Copyright:
© 2023 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2023
Y1 - 2023
N2 - Continuous trajectory tracking control of quadrotors is challenging when considering noise from the environment. Due to the difficulty in modeling the environmental dynamics, tracking methodologies based on conventional control theory, such as model predictive control, have limitations on tracking accuracy and response time. We propose a time-attenuating twin delayed DDPG, a model-free algorithm that is robust to noise, to better handle the trajectory tracking task. A deep reinforcement learning framework is constructed, where a time decay strategy is designed to avoid trapping into local optima. The experimental results show that the tracking error is significantly small, and the operation time is one-tenth of that of a traditional algorithm. The OpenAI Mujoco[1] tool is used to verify the proposed algorithm, and the simulation results show that, the proposed method can significantly improve the training efficiency and effectively improve the accuracy and convergence stability.
AB - Continuous trajectory tracking control of quadrotors is challenging when considering noise from the environment. Due to the difficulty in modeling the environmental dynamics, tracking methodologies based on conventional control theory, such as model predictive control, have limitations on tracking accuracy and response time. We propose a time-attenuating twin delayed DDPG, a model-free algorithm that is robust to noise, to better handle the trajectory tracking task. A deep reinforcement learning framework is constructed, where a time decay strategy is designed to avoid trapping into local optima. The experimental results show that the tracking error is significantly small, and the operation time is one-tenth of that of a traditional algorithm. The OpenAI Mujoco[1] tool is used to verify the proposed algorithm, and the simulation results show that, the proposed method can significantly improve the training efficiency and effectively improve the accuracy and convergence stability.
KW - deep reinforcement learning
KW - quadrotor
KW - trajectory tracking control
UR - http://www.scopus.com/inward/record.url?scp=85175535581&partnerID=8YFLogxK
U2 - 10.23919/CCC58697.2023.10241100
DO - 10.23919/CCC58697.2023.10241100
M3 - Conference contribution
AN - SCOPUS:85175535581
T3 - Chinese Control Conference, CCC
SP - 2323
EP - 2328
BT - 2023 42nd Chinese Control Conference, CCC 2023
PB - IEEE Computer Society
T2 - 42nd Chinese Control Conference, CCC 2023
Y2 - 24 July 2023 through 26 July 2023
ER -