新型分布式计算系统中的异构任务调度框架

Rui Qi Liu, Bo Yang Li*, Yu Jin Gao, Chang Sheng Li, Heng Tai Zhao, Fu Sheng Jin, Rong Hua Li, Guo Ren Wang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

2 引用 (Scopus)

摘要

With the rapid development of big data and machine learning, the distributed big data computing engine for machine learning have emerged. These systems can support both batch distributed learning and incremental learning and verification, with low latency and high performance. However, some of them adopt a random task scheduling strategy, ignoring the performance differences of nodes, which easily lead to uneven load and performance degradation. At the same time, for some tasks, if the resource requirements are not met, the scheduling will fail. In response to these problems, a heterogeneous task scheduling framework is proposed, which can ensure the efficient execution and execution of tasks. Specifically, for the task scheduling module, the proposed framework proposes a probabilistic random scheduling strategy resource-Pick_kx and a definite smooth weighted round-robin algorithm around the heterogeneous computing resources of nodes. The resource-Pick_kx al-gorithm calculates the probability according to the performance of the node, and performs random scheduling with probability. The higher the probability of a node with high performance, the higher the possibility of task scheduling to this node. The smooth weighted round-robin algorithm sets the weights according to the node performance at the beginning, and smoothly weights during the scheduling process, so that the task is scheduled to the node with the highest performance. In addition, for task scenarios where resources do not meet the requirements, a container-based vertical expansion mechanism is proposed to customize task resources, create nodes to join the cluster, and complete task scheduling again. The performance of the framework is tested on benchmarks and public data sets through ex-periments. Compared with the current strategy, the performance of the proposed frame is improved by 10% to 20%.

投稿的翻译标题Heterogeneous Task Scheduling Framework in Emerging Distributed Computing Systems
源语言繁体中文
页(从-至)1005-1017
页数13
期刊Ruan Jian Xue Bao/Journal of Software
33
3
DOI
出版状态已出版 - 3月 2022

关键词

  • Autoscale
  • Distributed computing
  • Heterogeneous task
  • Load balance
  • Task scheduling

指纹

探究 '新型分布式计算系统中的异构任务调度框架' 的科研主题。它们共同构成独一无二的指纹。

引用此