面向Flink迭代作业的动态资源分配策略

Xiao Fei Yue, Lan Shi, Yu Hai Zhao*, Hang Xu Ji, Guo Ren Wang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

Apache Flink, an emerging distributed computing framework, supports the execution of large-scale iterative programs on the cluster, but its default static resource allocation mechanism makes it impossible to carry out reasonable resource allocation to make iterative jobs complete on time. In response to this problem, that users should be relied on to actively express performance constraints rather than passively retain resources. RABORP, a dynamic resource allocation strategy based on runtime prediction is proposed to develop and implement a dynamic resource allocation plan for Flink iterative jobs with clear runtime limits. The main idea is to predict the runtime of each iteration superstep, and then the initial allocation and dynamic adjustment of resources are performed at the time of the iterative job submission and the synchronization barrier between the supersteps according to the predicted results, to ensure that the minimum set of resources can be used to complete the iterative job within the runtime limit specified by the user. A variety of typical Flink iterative jobs were executed under the dataset to carry out relevant comparative experiments. Experimental results show that the established runtime prediction model can accurately predict the runtime of each superstep, and compared with the current state-of-the-art algorithms, the proposed dynamic resource allocation strategy used in single-job and multi-job scenarios has improved various performance indicators.

投稿的翻译标题Dynamic Resource Allocation Strategy for Flink Iterative Jobs
源语言繁体中文
页(从-至)985-1004
页数20
期刊Ruan Jian Xue Bao/Journal of Software
33
3
DOI
出版状态已出版 - 3月 2022

关键词

  • Apache Flink
  • Iterative job
  • Resource allocation
  • Runtime limit
  • Runtime prediction

指纹

探究 '面向Flink迭代作业的动态资源分配策略' 的科研主题。它们共同构成独一无二的指纹。

引用此