WASC: Adapting scheduler configurations for heterogeneous mapreduce workloads

Siyi Wang, Fan Zhang*, Rui Han

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

MapReduce has emerged as a popular programming paradigm for data intensive computing in both scientific and commercial applications. On a MapReduce cluster, modern resource negotiation frameworks like Hadoop YARN and Mesos support scheduling of jobs submitted by multiple tenants. However, existing job schedulers lacks the automatic adaption to workload variations in their scheduling configuration, which is crucial for the jobs’ latencies because it determines how to share resources among the latest jobs in the system. The major challenge here is, to a MapReduce cluster scheduler, The performance of different configurations depends not only on the number of jobs in different queues, but also on their workload characteristics, which refer to the type and size of jobs. We introduce a workload-adaptive scheduling configuration (WASC) framework for heterogeneous MapReduce jobs. WASC identifies the optimal configuration for them by reasoning about their performances under different configurations.

Original languageEnglish
Title of host publicationBig Scientific Data Benchmarks, Architecture, and Systems - 1st Workshop, SDBA 2018, Revised Selected Papers
EditorsRui Ren, Chen Zheng, Jianfeng Zhan
PublisherSpringer Verlag
Pages45-54
Number of pages10
ISBN (Print)9789811359095
DOIs
Publication statusPublished - 2019
Externally publishedYes
Event1st Workshop on Big Scientific Data Benchmarks, Architecture, and Systems, SDBA 2018 - Beijing, China
Duration: 12 Jun 201812 Jun 2018

Publication series

NameCommunications in Computer and Information Science
Volume911
ISSN (Print)1865-0929

Conference

Conference1st Workshop on Big Scientific Data Benchmarks, Architecture, and Systems, SDBA 2018
Country/TerritoryChina
CityBeijing
Period12/06/1812/06/18

Keywords

  • Cluster schedulers
  • Configurations
  • MapReduce
  • Workload heterogeneous

Fingerprint

Dive into the research topics of 'WASC: Adapting scheduler configurations for heterogeneous mapreduce workloads'. Together they form a unique fingerprint.

Cite this