Fast Modeling of Analytics Workloads for Big Data Services

Lin Yang, Changsheng Li, Liya Fan, Jingmin Xu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Building models to predict analytics workloads' execution is a foundational capability that enables key scenarios for big data services, like SLA-driven service provisioning and elastic auto scaling. Given the various infrastructure and workload characteristics, it's more preferable to build the models in a 'black-box' fashion, for example, by leveraging machine learning techniques. However, this approach has assumptions on the volume and quality of workloads' existing records to learn from, which require sophisticate benchmark or long time monitoring. In this paper, we present a method to accelerate the modeling process of an analytics workload for its quick time-to-value in the context of big data services. Specifically, clustering and transfer learning techniques are leveraged for this acceleration by shifting the data collection from the online service phase to the offline preparation phase. This paper focuses on the conceived service model and fast modeling techniques. Their feasibility is demonstrated by experiments.

Original languageEnglish
Title of host publicationProceedings - 2014 International Conference on Service Sciences, ICSS 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages101-105
Number of pages5
ISBN (Electronic)9781479943302
DOIs
Publication statusPublished - 28 Oct 2015
Externally publishedYes
EventInternational Conference on Service Sciences, ICSS 2014 - Wuxi, Jiangsu, China
Duration: 22 May 201423 May 2014

Publication series

NameProceedings of International Conference on Service Science, ICSS
Volume2015-October
ISSN (Print)2165-3836
ISSN (Electronic)2165-3828

Conference

ConferenceInternational Conference on Service Sciences, ICSS 2014
Country/TerritoryChina
CityWuxi, Jiangsu
Period22/05/1423/05/14

Keywords

  • MapReduce
  • analytics
  • big data
  • cloud computing
  • machine learning
  • modeling
  • service

Fingerprint

Dive into the research topics of 'Fast Modeling of Analytics Workloads for Big Data Services'. Together they form a unique fingerprint.

Cite this