基于深度强化学习的驾驶仪参数快速整定方法

Qitian Wan; Baogang Lu; Yaxin Zhao; Qiuqiu Wen

doi:10.12305/j.issn.1001-506X.2022.10.23

基于深度强化学习的驾驶仪参数快速整定方法

Qitian Wan, Baogang Lu, Yaxin Zhao, Qiuqiu Wen^*

^*此作品的通讯作者

宇航学院

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

Aiming at the problem of slow training speed and poor convergence of deep reinforcement learning method for the autopilot control parameters training, an intelligent training method that converts three-dimensional control parameters into one-dimensional design parameters is proposed with the three-loop autopilot pole placement method as the core. The intelligent control architecture of offline deep reinforcement learning training and online multi-layer perceptron neural network real-time calculation is constructed, which improves the efficiency and convergence of deep reinforcement learning algorithm and ensures the rapid online tuning of control parameters under the condition of large-scale flight state changes. Taking a typical reentry aircraft as an example, the deep reinforcement learning training and neural network deployment are accomplished. The simulation results show that the training efficiency of the simplified reinforcement learning action space is higher, and the tracking error of the controller to the control command is less than 1.2% by the proposed parameter rapid tuning method based on deep reinforcement learning.

投稿的翻译标题	Autopilot parameter rapid tuning method based on deep reinforcement learning
源语言	繁体中文
页（从-至）	3190-3199
页数	10
期刊	Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics
卷	44
期	10
DOI	https://doi.org/10.12305/j.issn.1001-506X.2022.10.23
出版状态	已出版 - 10月 2022

关键词

autopilot
intelligent control
normalization
parameter tuning
reinforcement learning

访问文件

10.12305/j.issn.1001-506X.2022.10.23

其它文件与链接

链接到 Scopus 的出版物

引用此

Wan, Q., Lu, B., Zhao, Y., & Wen, Q. (2022). 基于深度强化学习的驾驶仪参数快速整定方法. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 44(10), 3190-3199. https://doi.org/10.12305/j.issn.1001-506X.2022.10.23

@article{00739519d2ad4d39b41bf09f9b6bb683,

title = "基于深度强化学习的驾驶仪参数快速整定方法",

abstract = "Aiming at the problem of slow training speed and poor convergence of deep reinforcement learning method for the autopilot control parameters training, an intelligent training method that converts three-dimensional control parameters into one-dimensional design parameters is proposed with the three-loop autopilot pole placement method as the core. The intelligent control architecture of offline deep reinforcement learning training and online multi-layer perceptron neural network real-time calculation is constructed, which improves the efficiency and convergence of deep reinforcement learning algorithm and ensures the rapid online tuning of control parameters under the condition of large-scale flight state changes. Taking a typical reentry aircraft as an example, the deep reinforcement learning training and neural network deployment are accomplished. The simulation results show that the training efficiency of the simplified reinforcement learning action space is higher, and the tracking error of the controller to the control command is less than 1.2% by the proposed parameter rapid tuning method based on deep reinforcement learning.",

keywords = "autopilot, intelligent control, normalization, parameter tuning, reinforcement learning",

author = "Qitian Wan and Baogang Lu and Yaxin Zhao and Qiuqiu Wen",

year = "2022",

month = oct,

doi = "10.12305/j.issn.1001-506X.2022.10.23",

language = "繁体中文",

volume = "44",

pages = "3190--3199",

journal = "Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics",

issn = "1001-506X",

publisher = "Chinese Institute of Electronics",

number = "10",

}

TY - JOUR

T1 - 基于深度强化学习的驾驶仪参数快速整定方法

AU - Wan, Qitian

AU - Lu, Baogang

AU - Zhao, Yaxin

AU - Wen, Qiuqiu

PY - 2022/10

Y1 - 2022/10

N2 - Aiming at the problem of slow training speed and poor convergence of deep reinforcement learning method for the autopilot control parameters training, an intelligent training method that converts three-dimensional control parameters into one-dimensional design parameters is proposed with the three-loop autopilot pole placement method as the core. The intelligent control architecture of offline deep reinforcement learning training and online multi-layer perceptron neural network real-time calculation is constructed, which improves the efficiency and convergence of deep reinforcement learning algorithm and ensures the rapid online tuning of control parameters under the condition of large-scale flight state changes. Taking a typical reentry aircraft as an example, the deep reinforcement learning training and neural network deployment are accomplished. The simulation results show that the training efficiency of the simplified reinforcement learning action space is higher, and the tracking error of the controller to the control command is less than 1.2% by the proposed parameter rapid tuning method based on deep reinforcement learning.

AB - Aiming at the problem of slow training speed and poor convergence of deep reinforcement learning method for the autopilot control parameters training, an intelligent training method that converts three-dimensional control parameters into one-dimensional design parameters is proposed with the three-loop autopilot pole placement method as the core. The intelligent control architecture of offline deep reinforcement learning training and online multi-layer perceptron neural network real-time calculation is constructed, which improves the efficiency and convergence of deep reinforcement learning algorithm and ensures the rapid online tuning of control parameters under the condition of large-scale flight state changes. Taking a typical reentry aircraft as an example, the deep reinforcement learning training and neural network deployment are accomplished. The simulation results show that the training efficiency of the simplified reinforcement learning action space is higher, and the tracking error of the controller to the control command is less than 1.2% by the proposed parameter rapid tuning method based on deep reinforcement learning.

KW - autopilot

KW - intelligent control

KW - normalization

KW - parameter tuning

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85143069698&partnerID=8YFLogxK

U2 - 10.12305/j.issn.1001-506X.2022.10.23

DO - 10.12305/j.issn.1001-506X.2022.10.23

M3 - 文章

AN - SCOPUS:85143069698

SN - 1001-506X

VL - 44

SP - 3190

EP - 3199

JO - Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics

JF - Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics

IS - 10

ER -

基于深度强化学习的驾驶仪参数快速整定方法

摘要

关键词

访问文件

其它文件与链接

指纹

引用此