摘要
With the fast development of big data systems in recent years, a variety of open-source benchmarks have been built to evaluate and compare the workloads on these systems, and to promote their technology improvement. However, to date no comprehensive survey has been written on this topic. This paper attempts to fill the void by presenting a review of the state-of-the-art big data benchmarking efforts. The paper first gives an overview of popular open-source benchmarks from the point of view of big data systems. It then reviews the three important aspects of benchmarking-workload generation techniques, workload input data generation techniques, and metrics used to assess systems. For each aspect, the paper divides the surveyed benchmarks into different categories and describes some representative benchmarks, rather than all benchmarks listed, in each category, following the discussion of potential research directions to motivate future work in this area.
源语言 | 英语 |
---|---|
页(从-至) | 580-597 |
页数 | 18 |
期刊 | IEEE Transactions on Services Computing |
卷 | 11 |
期 | 3 |
DOI | |
出版状态 | 已出版 - 1 5月 2018 |
已对外发布 | 是 |