面向多目标参数整定的协同深度强化学习方法

Senlin Luo; Jixun Wei; Xiaoshuang Liu; Limin Pan

doi:10.15918/j.tbit1001-0645.2021.218

面向多目标参数整定的协同深度强化学习方法

Senlin Luo, Jixun Wei, Xiaoshuang Liu, Limin Pan^*

^*此作品的通讯作者

信息与电子学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

The joint optimization and tuning of multi-objective control parameters is a key issue for the automation system to maintain efficient and stable operation. Reinforcement learning is often used to establish an automated parameter adjustment agent which can replace experts to complete parameter tuning. Existing methods use fixed weights to linearly combine multiple optimization objectives into a single objective and train a single agent model with fixed tuning knowledge, making the actual objective relationship do not match the initialization, the agent can't perceive and make adaptive decision-making adjustments, limiting the effect of parameter tuning. To solve the problem, a collaborative deep reinforcement learning method was proposed for multi-objective parameter tuning. Firstly, an offline simulation was used to learn objective tuning knowledge and to establish multiple Double-DQN agents. Then tuning effect feedback was established online to perceive the actual relationship between the objectives and adjust the agents' coordination strategy to achieve effective multi-objective parameter tuning. The experimental results of automatic train operation parameter tuning show that the proposed method presents better effect on the two goals of parking error and comfort, adapting to different track performance and continue optimization, processing great practical value.

投稿的翻译标题	Collaborative Deep Reinforcement Learning Method for Multi-Objective Parameter Tuning
源语言	繁体中文
页（从-至）	969-975
页数	7
期刊	Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology
卷	42
期	9
DOI	https://doi.org/10.15918/j.tbit1001-0645.2021.218
出版状态	已出版 - 9月 2022

关键词

automation system
coordination
multi-objective
parameter tuning
reinforcement learning

访问文件

10.15918/j.tbit1001-0645.2021.218

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{577a2c5bd3b94d938d9579aaa2e2fcf1,

title = "面向多目标参数整定的协同深度强化学习方法",

abstract = "The joint optimization and tuning of multi-objective control parameters is a key issue for the automation system to maintain efficient and stable operation. Reinforcement learning is often used to establish an automated parameter adjustment agent which can replace experts to complete parameter tuning. Existing methods use fixed weights to linearly combine multiple optimization objectives into a single objective and train a single agent model with fixed tuning knowledge, making the actual objective relationship do not match the initialization, the agent can't perceive and make adaptive decision-making adjustments, limiting the effect of parameter tuning. To solve the problem, a collaborative deep reinforcement learning method was proposed for multi-objective parameter tuning. Firstly, an offline simulation was used to learn objective tuning knowledge and to establish multiple Double-DQN agents. Then tuning effect feedback was established online to perceive the actual relationship between the objectives and adjust the agents' coordination strategy to achieve effective multi-objective parameter tuning. The experimental results of automatic train operation parameter tuning show that the proposed method presents better effect on the two goals of parking error and comfort, adapting to different track performance and continue optimization, processing great practical value.",

keywords = "automation system, coordination, multi-objective, parameter tuning, reinforcement learning",

author = "Senlin Luo and Jixun Wei and Xiaoshuang Liu and Limin Pan",

year = "2022",

month = sep,

doi = "10.15918/j.tbit1001-0645.2021.218",

language = "繁体中文",

volume = "42",

pages = "969--975",

journal = "Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology",

issn = "1001-0645",

publisher = "Beijing Institute of Technology",

number = "9",

}

TY - JOUR

T1 - 面向多目标参数整定的协同深度强化学习方法

AU - Luo, Senlin

AU - Wei, Jixun

AU - Liu, Xiaoshuang

AU - Pan, Limin

PY - 2022/9

Y1 - 2022/9

N2 - The joint optimization and tuning of multi-objective control parameters is a key issue for the automation system to maintain efficient and stable operation. Reinforcement learning is often used to establish an automated parameter adjustment agent which can replace experts to complete parameter tuning. Existing methods use fixed weights to linearly combine multiple optimization objectives into a single objective and train a single agent model with fixed tuning knowledge, making the actual objective relationship do not match the initialization, the agent can't perceive and make adaptive decision-making adjustments, limiting the effect of parameter tuning. To solve the problem, a collaborative deep reinforcement learning method was proposed for multi-objective parameter tuning. Firstly, an offline simulation was used to learn objective tuning knowledge and to establish multiple Double-DQN agents. Then tuning effect feedback was established online to perceive the actual relationship between the objectives and adjust the agents' coordination strategy to achieve effective multi-objective parameter tuning. The experimental results of automatic train operation parameter tuning show that the proposed method presents better effect on the two goals of parking error and comfort, adapting to different track performance and continue optimization, processing great practical value.

AB - The joint optimization and tuning of multi-objective control parameters is a key issue for the automation system to maintain efficient and stable operation. Reinforcement learning is often used to establish an automated parameter adjustment agent which can replace experts to complete parameter tuning. Existing methods use fixed weights to linearly combine multiple optimization objectives into a single objective and train a single agent model with fixed tuning knowledge, making the actual objective relationship do not match the initialization, the agent can't perceive and make adaptive decision-making adjustments, limiting the effect of parameter tuning. To solve the problem, a collaborative deep reinforcement learning method was proposed for multi-objective parameter tuning. Firstly, an offline simulation was used to learn objective tuning knowledge and to establish multiple Double-DQN agents. Then tuning effect feedback was established online to perceive the actual relationship between the objectives and adjust the agents' coordination strategy to achieve effective multi-objective parameter tuning. The experimental results of automatic train operation parameter tuning show that the proposed method presents better effect on the two goals of parking error and comfort, adapting to different track performance and continue optimization, processing great practical value.

KW - automation system

KW - coordination

KW - multi-objective

KW - parameter tuning

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85140214758&partnerID=8YFLogxK

U2 - 10.15918/j.tbit1001-0645.2021.218

DO - 10.15918/j.tbit1001-0645.2021.218

M3 - 文章

AN - SCOPUS:85140214758

SN - 1001-0645

VL - 42

SP - 969

EP - 975

JO - Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology

JF - Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology

IS - 9

ER -

面向多目标参数整定的协同深度强化学习方法

摘要

关键词

访问文件

其它文件与链接

指纹

引用此