Octopus: An End-to-end Multi-DAG Scheduling Method Based on Deep Reinforcement Learning

Yi Chang*, Haosong Peng, Yufeng Zhan, Yuanqing Xia

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

With the rapid growth of cloud computing, more and more vendors are deploying their services to the cloud. Efficient job scheduling is essential for enhancing system operation performance. These services, represented as Directed Acyclic Graphs (DAGs), usually have intricate dependencies. Existing research has limitations in solving the multi-DAG job scheduling problem and often overlooks end-to-end scheduling directly from tasks to servers. For example, scheduling each job individually without considering the overall information of all jobs might lead to an extended total completion time. To address these issues, this paper proposes Octopus, an intelligent end-to-end multi-DAG jobs scheduling algorithm based on deep reinforcement learning. Octopus is designed to address the challenges of dynamic and large input dimensions in the multi-DAG scheduling problem. A graph neural network feature extraction module is designed to extract the topological structure of multi-DAG jobs. The improved kernel-based network is then used to handle dynamic inputs. Simulation experiments conducted on different scales of DAG jobs and servers demonstrate that our approach can reduce the overall completion time of multi-DAG jobs up to 30% compared to traditional scheduling methods.

源语言英语
主期刊名Proceedings of the 43rd Chinese Control Conference, CCC 2024
编辑Jing Na, Jian Sun
出版商IEEE Computer Society
2588-2593
页数6
ISBN(电子版)9789887581581
DOI
出版状态已出版 - 2024
活动43rd Chinese Control Conference, CCC 2024 - Kunming, 中国
期限: 28 7月 202431 7月 2024

出版系列

姓名Chinese Control Conference, CCC
ISSN(印刷版)1934-1768
ISSN(电子版)2161-2927

会议

会议43rd Chinese Control Conference, CCC 2024
国家/地区中国
Kunming
时期28/07/2431/07/24

指纹

探究 'Octopus: An End-to-end Multi-DAG Scheduling Method Based on Deep Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此