Temporal Fusion Pointer network-based Reinforcement Learning algorithm for Multi-Objective Workflow Scheduling in the cloud

Binyang Wang; Huifang Li; Zhiwei Lin; Yuanqing Xia

doi:10.1109/IJCNN48605.2020.9207151

Temporal Fusion Pointer network-based Reinforcement Learning algorithm for Multi-Objective Workflow Scheduling in the cloud

Binyang Wang, Huifang Li, Zhiwei Lin, Yuanqing Xia

School of Automation

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

8 Citations (Scopus)

Abstract

Cloud computing is emerging as a deployment promising environment for hosting exponentially increasing scientific and social media applications, but how to manage and execute these applications efficiently depends mainly on workflow scheduling. However, scheduling workflows in the cloud is an NP-hard problem and its existing solutions have certain limitations when applied to real-world scenarios. In this paper, a Temporal Fusion Pointer network-based Reinforcement Learning algorithm for multi-objective workflow scheduling (TFP-RL) is proposed. Through adopting reinforcement learning, our algorithm can discover its heuristics over time by continuous learning according to the rewards resulting from good scheduling solutions. To make more comprehensive scheduling decisions as the influence of historical actions, a novel temporal fusion pointer network (TFP) is designed for the reinforcement learning agent, which can improve the quality of our resulting solutions and the ability of our algorithm in dealing with versatile workflow applications. To decrease convergence time, we train the proposed TFP-RL model independently by the Asynchronous Advantage Actor-Critic method and use its resulting model for scheduling workflows. Finally, under a multi-agent reinforcement learning framework, a Pareto dominance-oriented criterion for reasonable action selection is established for a multi-objective optimization scenario. We first train our TFP-RL model by taking randomly generated workflows as inputs to validate its effectiveness in scheduling, then compare our trained model with other existing scheduling approaches through practical compute- and data-intensive workflows. Experimental results demonstrate that our proposed algorithm outperforms the benchmarking ones in terms of different metrics.

Original language	English
Title of host publication	2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781728169262
DOIs	https://doi.org/10.1109/IJCNN48605.2020.9207151
Publication status	Published - Jul 2020
Event	2020 International Joint Conference on Neural Networks, IJCNN 2020 - Virtual, Glasgow, United Kingdom Duration: 19 Jul 2020 → 24 Jul 2020

Publication series

Name	Proceedings of the International Joint Conference on Neural Networks

Conference

Conference	2020 International Joint Conference on Neural Networks, IJCNN 2020
Country/Territory	United Kingdom
City	Virtual, Glasgow
Period	19/07/20 → 24/07/20

Keywords

Cloud computing
Multi-objective workflow scheduling
Neural networks
Reinforcement Learning

Access to Document

10.1109/IJCNN48605.2020.9207151

Cite this

Wang, B., Li, H., Lin, Z., & Xia, Y. (2020). Temporal Fusion Pointer network-based Reinforcement Learning algorithm for Multi-Objective Workflow Scheduling in the cloud. In 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings Article 9207151 (Proceedings of the International Joint Conference on Neural Networks). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IJCNN48605.2020.9207151

Wang, Binyang ; Li, Huifang ; Lin, Zhiwei et al. / Temporal Fusion Pointer network-based Reinforcement Learning algorithm for Multi-Objective Workflow Scheduling in the cloud. 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2020. (Proceedings of the International Joint Conference on Neural Networks).

@inproceedings{bdc62aff917f4b74a8f1eb75c8483670,

title = "Temporal Fusion Pointer network-based Reinforcement Learning algorithm for Multi-Objective Workflow Scheduling in the cloud",

abstract = "Cloud computing is emerging as a deployment promising environment for hosting exponentially increasing scientific and social media applications, but how to manage and execute these applications efficiently depends mainly on workflow scheduling. However, scheduling workflows in the cloud is an NP-hard problem and its existing solutions have certain limitations when applied to real-world scenarios. In this paper, a Temporal Fusion Pointer network-based Reinforcement Learning algorithm for multi-objective workflow scheduling (TFP-RL) is proposed. Through adopting reinforcement learning, our algorithm can discover its heuristics over time by continuous learning according to the rewards resulting from good scheduling solutions. To make more comprehensive scheduling decisions as the influence of historical actions, a novel temporal fusion pointer network (TFP) is designed for the reinforcement learning agent, which can improve the quality of our resulting solutions and the ability of our algorithm in dealing with versatile workflow applications. To decrease convergence time, we train the proposed TFP-RL model independently by the Asynchronous Advantage Actor-Critic method and use its resulting model for scheduling workflows. Finally, under a multi-agent reinforcement learning framework, a Pareto dominance-oriented criterion for reasonable action selection is established for a multi-objective optimization scenario. We first train our TFP-RL model by taking randomly generated workflows as inputs to validate its effectiveness in scheduling, then compare our trained model with other existing scheduling approaches through practical compute- and data-intensive workflows. Experimental results demonstrate that our proposed algorithm outperforms the benchmarking ones in terms of different metrics.",

keywords = "Cloud computing, Multi-objective workflow scheduling, Neural networks, Reinforcement Learning",

author = "Binyang Wang and Huifang Li and Zhiwei Lin and Yuanqing Xia",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 2020 International Joint Conference on Neural Networks, IJCNN 2020 ; Conference date: 19-07-2020 Through 24-07-2020",

year = "2020",

month = jul,

doi = "10.1109/IJCNN48605.2020.9207151",

language = "English",

series = "Proceedings of the International Joint Conference on Neural Networks",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings",

address = "United States",

}

Wang, B, Li, H, Lin, Z & Xia, Y 2020, Temporal Fusion Pointer network-based Reinforcement Learning algorithm for Multi-Objective Workflow Scheduling in the cloud. in 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings., 9207151, Proceedings of the International Joint Conference on Neural Networks, Institute of Electrical and Electronics Engineers Inc., 2020 International Joint Conference on Neural Networks, IJCNN 2020, Virtual, Glasgow, United Kingdom, 19/07/20. https://doi.org/10.1109/IJCNN48605.2020.9207151

Temporal Fusion Pointer network-based Reinforcement Learning algorithm for Multi-Objective Workflow Scheduling in the cloud. / Wang, Binyang; Li, Huifang; Lin, Zhiwei et al.
2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2020. 9207151 (Proceedings of the International Joint Conference on Neural Networks).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Temporal Fusion Pointer network-based Reinforcement Learning algorithm for Multi-Objective Workflow Scheduling in the cloud

AU - Wang, Binyang

AU - Li, Huifang

AU - Lin, Zhiwei

AU - Xia, Yuanqing

PY - 2020/7

Y1 - 2020/7

N2 - Cloud computing is emerging as a deployment promising environment for hosting exponentially increasing scientific and social media applications, but how to manage and execute these applications efficiently depends mainly on workflow scheduling. However, scheduling workflows in the cloud is an NP-hard problem and its existing solutions have certain limitations when applied to real-world scenarios. In this paper, a Temporal Fusion Pointer network-based Reinforcement Learning algorithm for multi-objective workflow scheduling (TFP-RL) is proposed. Through adopting reinforcement learning, our algorithm can discover its heuristics over time by continuous learning according to the rewards resulting from good scheduling solutions. To make more comprehensive scheduling decisions as the influence of historical actions, a novel temporal fusion pointer network (TFP) is designed for the reinforcement learning agent, which can improve the quality of our resulting solutions and the ability of our algorithm in dealing with versatile workflow applications. To decrease convergence time, we train the proposed TFP-RL model independently by the Asynchronous Advantage Actor-Critic method and use its resulting model for scheduling workflows. Finally, under a multi-agent reinforcement learning framework, a Pareto dominance-oriented criterion for reasonable action selection is established for a multi-objective optimization scenario. We first train our TFP-RL model by taking randomly generated workflows as inputs to validate its effectiveness in scheduling, then compare our trained model with other existing scheduling approaches through practical compute- and data-intensive workflows. Experimental results demonstrate that our proposed algorithm outperforms the benchmarking ones in terms of different metrics.

AB - Cloud computing is emerging as a deployment promising environment for hosting exponentially increasing scientific and social media applications, but how to manage and execute these applications efficiently depends mainly on workflow scheduling. However, scheduling workflows in the cloud is an NP-hard problem and its existing solutions have certain limitations when applied to real-world scenarios. In this paper, a Temporal Fusion Pointer network-based Reinforcement Learning algorithm for multi-objective workflow scheduling (TFP-RL) is proposed. Through adopting reinforcement learning, our algorithm can discover its heuristics over time by continuous learning according to the rewards resulting from good scheduling solutions. To make more comprehensive scheduling decisions as the influence of historical actions, a novel temporal fusion pointer network (TFP) is designed for the reinforcement learning agent, which can improve the quality of our resulting solutions and the ability of our algorithm in dealing with versatile workflow applications. To decrease convergence time, we train the proposed TFP-RL model independently by the Asynchronous Advantage Actor-Critic method and use its resulting model for scheduling workflows. Finally, under a multi-agent reinforcement learning framework, a Pareto dominance-oriented criterion for reasonable action selection is established for a multi-objective optimization scenario. We first train our TFP-RL model by taking randomly generated workflows as inputs to validate its effectiveness in scheduling, then compare our trained model with other existing scheduling approaches through practical compute- and data-intensive workflows. Experimental results demonstrate that our proposed algorithm outperforms the benchmarking ones in terms of different metrics.

KW - Cloud computing

KW - Multi-objective workflow scheduling

KW - Neural networks

KW - Reinforcement Learning

UR - http://www.scopus.com/inward/record.url?scp=85093816795&partnerID=8YFLogxK

U2 - 10.1109/IJCNN48605.2020.9207151

DO - 10.1109/IJCNN48605.2020.9207151

M3 - Conference contribution

AN - SCOPUS:85093816795

T3 - Proceedings of the International Joint Conference on Neural Networks

BT - 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2020 International Joint Conference on Neural Networks, IJCNN 2020

Y2 - 19 July 2020 through 24 July 2020

ER -

Wang B, Li H, Lin Z, Xia Y. Temporal Fusion Pointer network-based Reinforcement Learning algorithm for Multi-Objective Workflow Scheduling in the cloud. In 2020 International Joint Conference on Neural Networks, IJCNN 2020 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2020. 9207151. (Proceedings of the International Joint Conference on Neural Networks). doi: 10.1109/IJCNN48605.2020.9207151

Temporal Fusion Pointer network-based Reinforcement Learning algorithm for Multi-Objective Workflow Scheduling in the cloud

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this