基于深度强化学习的驾驶仪参数快速整定方法

Qitian Wan, Baogang Lu, Yaxin Zhao, Qiuqiu Wen*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

2 引用 (Scopus)

摘要

Aiming at the problem of slow training speed and poor convergence of deep reinforcement learning method for the autopilot control parameters training, an intelligent training method that converts three-dimensional control parameters into one-dimensional design parameters is proposed with the three-loop autopilot pole placement method as the core. The intelligent control architecture of offline deep reinforcement learning training and online multi-layer perceptron neural network real-time calculation is constructed, which improves the efficiency and convergence of deep reinforcement learning algorithm and ensures the rapid online tuning of control parameters under the condition of large-scale flight state changes. Taking a typical reentry aircraft as an example, the deep reinforcement learning training and neural network deployment are accomplished. The simulation results show that the training efficiency of the simplified reinforcement learning action space is higher, and the tracking error of the controller to the control command is less than 1.2% by the proposed parameter rapid tuning method based on deep reinforcement learning.

投稿的翻译标题Autopilot parameter rapid tuning method based on deep reinforcement learning
源语言繁体中文
页(从-至)3190-3199
页数10
期刊Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics
44
10
DOI
出版状态已出版 - 10月 2022

关键词

  • autopilot
  • intelligent control
  • normalization
  • parameter tuning
  • reinforcement learning

指纹

探究 '基于深度强化学习的驾驶仪参数快速整定方法' 的科研主题。它们共同构成独一无二的指纹。

引用此