Weighted double deep Q-network based reinforcement learning for bi-objective multi-workflow scheduling in the cloud

Huifang Li*, Jianghang Huang, Binyang Wang, Yushun Fan

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

18 Citations (Scopus)

Abstract

As a promising distributed paradigm, cloud computing provides a cost-effective deploying environment for hosting scientific applications due to its provisioning elastic, heterogeneous resources in a pay-per-use model. More and more applications modeled as workflows are being moved to the cloud, and time and cost become important for workflow execution. However, scheduling workflows is still a challenge due to their large-scale and complexity, as well as the cloud’s dynamic characteristics and different quotations. In this work, we propose a Weighted Double Deep Q-Network-based Reinforcement Learning algorithm (WDDQN-RL) for scheduling multiple workflows to obtain near-optimal solutions in a relatively short time with both makespan and cost minimized. Specifically, we first introduce a dynamic coefficient-based adaptive balancing method into WDDQN to improve the accuracy of the target value estimation by making a trade-off between Deep Q-Network (DQN) overestimation and Double Deep Q-Network (DDQN) underestimation. Second, pointer network-based agents and a two-level scheduling strategy are designed, where pointer networks are used to process a variable candidate task set in the first-level and one selected task is fed to agents in the second-level for allocating resources. Third, we present a dynamic sensing mechanism by adjusting the model’s attention to each individual objective for increasing the diversity of solutions while guaranteeing their quality. Experimental results show that our algorithm outperforms the benchmarking approaches in various indicators.

Original languageEnglish
Pages (from-to)751-768
Number of pages18
JournalCluster Computing
Volume25
Issue number2
DOIs
Publication statusPublished - Apr 2022

Keywords

  • Cloud computing
  • Multi-objective workflow scheduling
  • Reinforcement learning
  • Weighted double deep Q-networks

Fingerprint

Dive into the research topics of 'Weighted double deep Q-network based reinforcement learning for bi-objective multi-workflow scheduling in the cloud'. Together they form a unique fingerprint.

Cite this