CLAP: Component-Level Approximate Processing for Low Tail Latency and High Result Accuracy in Cloud Online Services

Rui Han, Siguang Huang, Zhentao Wang, Jianfeng Zhan

科研成果: 期刊稿件文章同行评审

5 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 5
  • Captures
    • Readers: 13
see details

摘要

Modern latency-critical online services such as search engines often process requests by consulting large input data spanning massive parallel components. Hence the tail latency of these components determines the service latency. To trade off result accuracy for tail latency reduction, existing techniques use the components responding before a specified deadline to produce approximate results. However, they skip a large proportion of components when load gets heavier, thus incurring large accuracy losses. In this paper, we propose CLAP to enable component-level approximate processing of requests for low tail latency and small accuracy losses. CLAP aggregates information of input data to create small aggregated data points. Using these points, CLAP reduces latency variance of parallel components and allows them to produce initial results quickly; CLAP also identifies the parts of input data most related to requests' result accuracies, thus first using these parts to improve the produced results to minimize accuracy losses. We evaluated CLAP using real services and datasets. The results show: (i) CLAP reduces tail latency by 6.46 times with accuracy losses of 2.2 percent compared to existing exact processing techniques; (ii) when using the same latency, CLAP reduces accuracy losses by 31.58 times compared to existing approximate processing techniques.

源语言英语
文章编号7812758
页(从-至)2190-2203
页数14
期刊IEEE Transactions on Parallel and Distributed Systems
28
8
DOI
出版状态已出版 - 1 8月 2017
已对外发布

指纹

探究 'CLAP: Component-Level Approximate Processing for Low Tail Latency and High Result Accuracy in Cloud Online Services' 的科研主题。它们共同构成独一无二的指纹。

引用此

Han, R., Huang, S., Wang, Z., & Zhan, J. (2017). CLAP: Component-Level Approximate Processing for Low Tail Latency and High Result Accuracy in Cloud Online Services. IEEE Transactions on Parallel and Distributed Systems, 28(8), 2190-2203. 文章 7812758. https://doi.org/10.1109/TPDS.2017.2650988