Weighted double deep Q-network based reinforcement learning for bi-objective multi-workflow scheduling in the cloud

Huifang Li; Jianghang Huang; Binyang Wang; Yushun Fan

doi:10.1007/s10586-021-03454-6

Weighted double deep Q-network based reinforcement learning for bi-objective multi-workflow scheduling in the cloud

Huifang Li^*, Jianghang Huang, Binyang Wang, Yushun Fan

^*此作品的通讯作者

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

26 引用（Scopus）

摘要

As a promising distributed paradigm, cloud computing provides a cost-effective deploying environment for hosting scientific applications due to its provisioning elastic, heterogeneous resources in a pay-per-use model. More and more applications modeled as workflows are being moved to the cloud, and time and cost become important for workflow execution. However, scheduling workflows is still a challenge due to their large-scale and complexity, as well as the cloud’s dynamic characteristics and different quotations. In this work, we propose a Weighted Double Deep Q-Network-based Reinforcement Learning algorithm (WDDQN-RL) for scheduling multiple workflows to obtain near-optimal solutions in a relatively short time with both makespan and cost minimized. Specifically, we first introduce a dynamic coefficient-based adaptive balancing method into WDDQN to improve the accuracy of the target value estimation by making a trade-off between Deep Q-Network (DQN) overestimation and Double Deep Q-Network (DDQN) underestimation. Second, pointer network-based agents and a two-level scheduling strategy are designed, where pointer networks are used to process a variable candidate task set in the first-level and one selected task is fed to agents in the second-level for allocating resources. Third, we present a dynamic sensing mechanism by adjusting the model’s attention to each individual objective for increasing the diversity of solutions while guaranteeing their quality. Experimental results show that our algorithm outperforms the benchmarking approaches in various indicators.

源语言	英语
页（从-至）	751-768
页数	18
期刊	Cluster Computing
卷	25
期	2
DOI	https://doi.org/10.1007/s10586-021-03454-6
出版状态	已出版 - 4月 2022

访问文件

10.1007/s10586-021-03454-6

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{4dbccc5d3eab4a098c080d9ce321834a,

title = "Weighted double deep Q-network based reinforcement learning for bi-objective multi-workflow scheduling in the cloud",

abstract = "As a promising distributed paradigm, cloud computing provides a cost-effective deploying environment for hosting scientific applications due to its provisioning elastic, heterogeneous resources in a pay-per-use model. More and more applications modeled as workflows are being moved to the cloud, and time and cost become important for workflow execution. However, scheduling workflows is still a challenge due to their large-scale and complexity, as well as the cloud{\textquoteright}s dynamic characteristics and different quotations. In this work, we propose a Weighted Double Deep Q-Network-based Reinforcement Learning algorithm (WDDQN-RL) for scheduling multiple workflows to obtain near-optimal solutions in a relatively short time with both makespan and cost minimized. Specifically, we first introduce a dynamic coefficient-based adaptive balancing method into WDDQN to improve the accuracy of the target value estimation by making a trade-off between Deep Q-Network (DQN) overestimation and Double Deep Q-Network (DDQN) underestimation. Second, pointer network-based agents and a two-level scheduling strategy are designed, where pointer networks are used to process a variable candidate task set in the first-level and one selected task is fed to agents in the second-level for allocating resources. Third, we present a dynamic sensing mechanism by adjusting the model{\textquoteright}s attention to each individual objective for increasing the diversity of solutions while guaranteeing their quality. Experimental results show that our algorithm outperforms the benchmarking approaches in various indicators.",

keywords = "Cloud computing, Multi-objective workflow scheduling, Reinforcement learning, Weighted double deep Q-networks",

author = "Huifang Li and Jianghang Huang and Binyang Wang and Yushun Fan",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.",

year = "2022",

month = apr,

doi = "10.1007/s10586-021-03454-6",

language = "English",

volume = "25",

pages = "751--768",

journal = "Cluster Computing",

issn = "1386-7857",

publisher = "Kluwer Academic Publishers",

number = "2",

}

TY - JOUR

T1 - Weighted double deep Q-network based reinforcement learning for bi-objective multi-workflow scheduling in the cloud

AU - Li, Huifang

AU - Huang, Jianghang

AU - Wang, Binyang

AU - Fan, Yushun

PY - 2022/4

Y1 - 2022/4

N2 - As a promising distributed paradigm, cloud computing provides a cost-effective deploying environment for hosting scientific applications due to its provisioning elastic, heterogeneous resources in a pay-per-use model. More and more applications modeled as workflows are being moved to the cloud, and time and cost become important for workflow execution. However, scheduling workflows is still a challenge due to their large-scale and complexity, as well as the cloud’s dynamic characteristics and different quotations. In this work, we propose a Weighted Double Deep Q-Network-based Reinforcement Learning algorithm (WDDQN-RL) for scheduling multiple workflows to obtain near-optimal solutions in a relatively short time with both makespan and cost minimized. Specifically, we first introduce a dynamic coefficient-based adaptive balancing method into WDDQN to improve the accuracy of the target value estimation by making a trade-off between Deep Q-Network (DQN) overestimation and Double Deep Q-Network (DDQN) underestimation. Second, pointer network-based agents and a two-level scheduling strategy are designed, where pointer networks are used to process a variable candidate task set in the first-level and one selected task is fed to agents in the second-level for allocating resources. Third, we present a dynamic sensing mechanism by adjusting the model’s attention to each individual objective for increasing the diversity of solutions while guaranteeing their quality. Experimental results show that our algorithm outperforms the benchmarking approaches in various indicators.

AB - As a promising distributed paradigm, cloud computing provides a cost-effective deploying environment for hosting scientific applications due to its provisioning elastic, heterogeneous resources in a pay-per-use model. More and more applications modeled as workflows are being moved to the cloud, and time and cost become important for workflow execution. However, scheduling workflows is still a challenge due to their large-scale and complexity, as well as the cloud’s dynamic characteristics and different quotations. In this work, we propose a Weighted Double Deep Q-Network-based Reinforcement Learning algorithm (WDDQN-RL) for scheduling multiple workflows to obtain near-optimal solutions in a relatively short time with both makespan and cost minimized. Specifically, we first introduce a dynamic coefficient-based adaptive balancing method into WDDQN to improve the accuracy of the target value estimation by making a trade-off between Deep Q-Network (DQN) overestimation and Double Deep Q-Network (DDQN) underestimation. Second, pointer network-based agents and a two-level scheduling strategy are designed, where pointer networks are used to process a variable candidate task set in the first-level and one selected task is fed to agents in the second-level for allocating resources. Third, we present a dynamic sensing mechanism by adjusting the model’s attention to each individual objective for increasing the diversity of solutions while guaranteeing their quality. Experimental results show that our algorithm outperforms the benchmarking approaches in various indicators.

KW - Cloud computing

KW - Multi-objective workflow scheduling

KW - Reinforcement learning

KW - Weighted double deep Q-networks

UR - http://www.scopus.com/inward/record.url?scp=85118409341&partnerID=8YFLogxK

U2 - 10.1007/s10586-021-03454-6

DO - 10.1007/s10586-021-03454-6

M3 - Article

AN - SCOPUS:85118409341

SN - 1386-7857

VL - 25

SP - 751

EP - 768

JO - Cluster Computing

JF - Cluster Computing

IS - 2

ER -

Weighted double deep Q-network based reinforcement learning for bi-objective multi-workflow scheduling in the cloud

摘要

访问文件

其它文件与链接

指纹

引用此