基于多维度特征融合的云工作流任务执行时间预测方法

Hui Fang Li, Jiang Hang Huang, Guang Hao Xu, Yuan Qing Xia

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

Task runtime estimation is a prerequisite for workflow scheduling in cloud data centers. However, the existing runtime prediction methods for workflow activities fail to effectively extract categorical and numerical features. In this paper, we propose a multi-dimensional feature fusion-based runtime prediction approach for workflow tasks. Firstly, we construct a stacked residual recurrent neural network with an attention mechanism for mapping categorical data from high-dimensional sparse space to low-dimensional dense space so as to enlarge its capability of parsing categorical data for categorical feature extraction. Secondly, extreme gradient boosting is introduced to discretize the numerical data and enhance the nonlinear representation capability for numerical features through sparsely processing the input vectors within dense space. Thirdly, we design a heterogeneous multi-dimensional feature fusion strategy, and then blend the extracted features with original inputs to mine comprehensive knowledge for runtime prediction. Finally, based on the resulting multi-dimensional fused features, a prediction model is developed to fully utilize these features as well as its corresponding hidden knowledge and then to forecast the runtimes accurately for cloud workflow tasks. To verify the effectiveness and superiority of the proposed method, we conduct extensive experiments on a cluster dataset from a real cloud data center. The experimental results show that, our approach outperforms the existing algorithms and can be applied in big data-driven runtime prediction for workflow activities in the cloud.

投稿的翻译标题Multi-dimensional Feature Fusion-based Runtime Prediction Approach for Cloud Workflow Tasks
源语言繁体中文
页(从-至)67-78
页数12
期刊Zidonghua Xuebao/Acta Automatica Sinica
49
1
DOI
出版状态已出版 - 1月 2023

关键词

  • Cloud data centers
  • ensemble learning
  • execution time prediction
  • feature fusion
  • workflows

指纹

探究 '基于多维度特征融合的云工作流任务执行时间预测方法' 的科研主题。它们共同构成独一无二的指纹。

引用此