Benchmarking Big Data Systems: A Review

Rui Han*, Lizy Kurian John, Jianfeng Zhan

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

62 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 61
    • Policy Citations: 1
  • Captures
    • Readers: 103
see details

摘要

With the fast development of big data systems in recent years, a variety of open-source benchmarks have been built to evaluate and compare the workloads on these systems, and to promote their technology improvement. However, to date no comprehensive survey has been written on this topic. This paper attempts to fill the void by presenting a review of the state-of-the-art big data benchmarking efforts. The paper first gives an overview of popular open-source benchmarks from the point of view of big data systems. It then reviews the three important aspects of benchmarking-workload generation techniques, workload input data generation techniques, and metrics used to assess systems. For each aspect, the paper divides the surveyed benchmarks into different categories and describes some representative benchmarks, rather than all benchmarks listed, in each category, following the discussion of potential research directions to motivate future work in this area.

源语言英语
页(从-至)580-597
页数18
期刊IEEE Transactions on Services Computing
11
3
DOI
出版状态已出版 - 1 5月 2018
已对外发布

指纹

探究 'Benchmarking Big Data Systems: A Review' 的科研主题。它们共同构成独一无二的指纹。

引用此

Han, R., John, L. K., & Zhan, J. (2018). Benchmarking Big Data Systems: A Review. IEEE Transactions on Services Computing, 11(3), 580-597. https://doi.org/10.1109/TSC.2017.2730882