新型分布式计算系统中的异构任务调度框架

Translated title of the contribution: Heterogeneous Task Scheduling Framework in Emerging Distributed Computing Systems

Rui Qi Liu, Bo Yang Li*, Yu Jin Gao, Chang Sheng Li, Heng Tai Zhao, Fu Sheng Jin, Rong Hua Li, Guo Ren Wang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

With the rapid development of big data and machine learning, the distributed big data computing engine for machine learning have emerged. These systems can support both batch distributed learning and incremental learning and verification, with low latency and high performance. However, some of them adopt a random task scheduling strategy, ignoring the performance differences of nodes, which easily lead to uneven load and performance degradation. At the same time, for some tasks, if the resource requirements are not met, the scheduling will fail. In response to these problems, a heterogeneous task scheduling framework is proposed, which can ensure the efficient execution and execution of tasks. Specifically, for the task scheduling module, the proposed framework proposes a probabilistic random scheduling strategy resource-Pick_kx and a definite smooth weighted round-robin algorithm around the heterogeneous computing resources of nodes. The resource-Pick_kx al-gorithm calculates the probability according to the performance of the node, and performs random scheduling with probability. The higher the probability of a node with high performance, the higher the possibility of task scheduling to this node. The smooth weighted round-robin algorithm sets the weights according to the node performance at the beginning, and smoothly weights during the scheduling process, so that the task is scheduled to the node with the highest performance. In addition, for task scenarios where resources do not meet the requirements, a container-based vertical expansion mechanism is proposed to customize task resources, create nodes to join the cluster, and complete task scheduling again. The performance of the framework is tested on benchmarks and public data sets through ex-periments. Compared with the current strategy, the performance of the proposed frame is improved by 10% to 20%.

Translated title of the contributionHeterogeneous Task Scheduling Framework in Emerging Distributed Computing Systems
Original languageChinese (Traditional)
Pages (from-to)1005-1017
Number of pages13
JournalRuan Jian Xue Bao/Journal of Software
Volume33
Issue number3
DOIs
Publication statusPublished - Mar 2022

Fingerprint

Dive into the research topics of 'Heterogeneous Task Scheduling Framework in Emerging Distributed Computing Systems'. Together they form a unique fingerprint.

Cite this