BigDataBench-MT: A benchmark tool for generating realistic mixed data center workloads

Rui Han*, Shulin Zhan, Chenrong Shao, Junwei Wang, Lizy K. John, Jiangtao Xu, Gang Lu, Lei Wang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 4
  • Captures
    • Readers: 19
see details

Abstract

Long-running service workloads (e.g. web search engine) and short-term data analysis workloads (e.g. Hadoop MapReduce jobs) colocate in today’s data centers. Developing realistic benchmarks to reflect such practical scenario of mixed workload is a key problem to produce trustworthy results when evaluating and comparing data center systems. This requires using actual workloads as well as guaranteeing their submissions to follow patterns hidden in real-world traces. However, existing benchmarks either generate actual workloads based on probability models, or replay real-world workload traces using basic I/O operations. To fill this gap, we propose a benchmark tool that is a first step towards generating a mix of actual service and data analysis workloads on the basis of real workload traces. Our tool includes a combiner that enables the replaying of actual workloads according to the workload traces, and a multi-tenant generator that flexibly scales the workloads up and down according to users’ requirements. Based on this, our demo illustrates the workload customization and generation process using a visual interface. The proposed tool, called BigDataBench-MT, is a multitenant version of our comprehensive benchmark suite BigDataBench and it is publicly available from http://prof.ict.ac.cn/BigDataBench/ multi-tenancyversion/.

Original languageEnglish
Title of host publicationBig Data Benchmarks, Performance Optimization, and Emerging Hardware - 6th Workshop, BPOE 2015, Revised Selected Papers
EditorsRoberto V. Zicari, Jianfeng Zhan, Rui Han
PublisherSpringer Verlag
Pages10-21
Number of pages12
ISBN (Print)9783319290058
DOIs
Publication statusPublished - 2016
Externally publishedYes
Event6th Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware, BPOE 2015 - Kohala, United States
Duration: 31 Aug 20154 Sept 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9495
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th Workshop on Big Data Benchmarks, Performance Optimization, and Emerging Hardware, BPOE 2015
Country/TerritoryUnited States
CityKohala
Period31/08/154/09/15

Keywords

  • Benchmark
  • Data center
  • Mixed workloads
  • Workload trace

Fingerprint

Dive into the research topics of 'BigDataBench-MT: A benchmark tool for generating realistic mixed data center workloads'. Together they form a unique fingerprint.

Cite this

Han, R., Zhan, S., Shao, C., Wang, J., John, L. K., Xu, J., Lu, G., & Wang, L. (2016). BigDataBench-MT: A benchmark tool for generating realistic mixed data center workloads. In R. V. Zicari, J. Zhan, & R. Han (Eds.), Big Data Benchmarks, Performance Optimization, and Emerging Hardware - 6th Workshop, BPOE 2015, Revised Selected Papers (pp. 10-21). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9495). Springer Verlag. https://doi.org/10.1007/978-3-319-29006-5_2