TY - JOUR
T1 - Design of Parameter-Self-Tuning Controller Based on Reinforcement Learning for Tracking Noncooperative Targets in Space
AU - Wang, Xiao
AU - Shi, Peng
AU - Wen, Changxuan
AU - Zhao, Yushan
N1 - Publisher Copyright:
© 1965-2011 IEEE.
PY - 2020/12
Y1 - 2020/12
N2 - Tracking space noncooperative targets, including disabled and mobile spacecrafts, remains a challenging problem. This article develops two reinforcement-learning-based parameter-self-tuning controllers for the following two different tracking cases: case A, tracking a disabled target, and case B, tracking a mobile target. An adaptive controller consisting of five model uncertainties is adopted for case A, and a modified PD controller is derived for case B. The actor-critic framework is employed to reduce the initial control accelerations for case A and to improve the terminal tracking accuracy for case B. Relations between control parameters and tracking errors are found through the fuzzy inference system. Finally, the reinforcement learning is used to select suitable control parameters for achieving desired purposes. Numerical experimental results validate the effectiveness of the proposed algorithms on reducing initial control accelerations for case A and improving the terminal tracking accuracy for case B.
AB - Tracking space noncooperative targets, including disabled and mobile spacecrafts, remains a challenging problem. This article develops two reinforcement-learning-based parameter-self-tuning controllers for the following two different tracking cases: case A, tracking a disabled target, and case B, tracking a mobile target. An adaptive controller consisting of five model uncertainties is adopted for case A, and a modified PD controller is derived for case B. The actor-critic framework is employed to reduce the initial control accelerations for case A and to improve the terminal tracking accuracy for case B. Relations between control parameters and tracking errors are found through the fuzzy inference system. Finally, the reinforcement learning is used to select suitable control parameters for achieving desired purposes. Numerical experimental results validate the effectiveness of the proposed algorithms on reducing initial control accelerations for case A and improving the terminal tracking accuracy for case B.
KW - Actor-critic learning
KW - noncooperative target
KW - parameter-tuning
KW - tracking control
UR - http://www.scopus.com/inward/record.url?scp=85097741998&partnerID=8YFLogxK
U2 - 10.1109/TAES.2020.2988170
DO - 10.1109/TAES.2020.2988170
M3 - Article
AN - SCOPUS:85097741998
SN - 0018-9251
VL - 56
SP - 4192
EP - 4208
JO - IEEE Transactions on Aerospace and Electronic Systems
JF - IEEE Transactions on Aerospace and Electronic Systems
IS - 6
M1 - 9076607
ER -