TY - GEN
T1 - SARP
T2 - 12th ACM International Conference on Computing Frontiers, CF 2015
AU - Han, Rui
AU - Wang, Junwei
AU - Ge, Fengming
AU - Vazquez-Poletti, Jose Luis
AU - Zhan, Jianfeng
N1 - Publisher Copyright:
© Copyright 2015 ACM.
PY - 2015/5/6
Y1 - 2015/5/6
N2 - Despite the importance of providing fluid responsiveness to user requests for interactive services, such request processing is very resource expensive when dealing with large-scale input data. These often exceed the application owners' budget when services are deployed on a cloud, in which resources are charged in monetary terms. Providing approximate processing results is a feasible solution for such problem that trades off request correctness (quantified by output quality) for response time reduction. However, existing techniques in this area either use partial input data or skip expensive computations to produce approximate results, thus resulting in large losses in output quality on a tight resource budget. In this paper, we propose SARP, a Synopsis-based Approximate Request Processing framework to produce approximate results with small correctness losses even using small amount of resources. To achieve this, SARP conducts full computations over the statistical aggregation of the entire input data using two key ideas: (1) offline synopsis management that generates and maintains a set of synopses that represent the statistical aggregation of original input data at different approximation levels. (2) Online synopsis selection that considers both the current resource allocation and the workload status so as to select the synopsis with the maximal length that can be processed within the required response time. We demonstrate the effectiveness of our approach by testing the recommendation services in E-commerce sites using a large, real-world dataset. Using prediction accuracy as the output quality, the results demonstrate: (i) SARP achieves significant response time reduction with very small quality losses compared to the exact processing results.(ii) Using the same processing time, SARP demonstrates a considerable reduction in quality loss compared to existing approximation techniques.
AB - Despite the importance of providing fluid responsiveness to user requests for interactive services, such request processing is very resource expensive when dealing with large-scale input data. These often exceed the application owners' budget when services are deployed on a cloud, in which resources are charged in monetary terms. Providing approximate processing results is a feasible solution for such problem that trades off request correctness (quantified by output quality) for response time reduction. However, existing techniques in this area either use partial input data or skip expensive computations to produce approximate results, thus resulting in large losses in output quality on a tight resource budget. In this paper, we propose SARP, a Synopsis-based Approximate Request Processing framework to produce approximate results with small correctness losses even using small amount of resources. To achieve this, SARP conducts full computations over the statistical aggregation of the entire input data using two key ideas: (1) offline synopsis management that generates and maintains a set of synopses that represent the statistical aggregation of original input data at different approximation levels. (2) Online synopsis selection that considers both the current resource allocation and the workload status so as to select the synopsis with the maximal length that can be processed within the required response time. We demonstrate the effectiveness of our approach by testing the recommendation services in E-commerce sites using a large, real-world dataset. Using prediction accuracy as the output quality, the results demonstrate: (i) SARP achieves significant response time reduction with very small quality losses compared to the exact processing results.(ii) Using the same processing time, SARP demonstrates a considerable reduction in quality loss compared to existing approximation techniques.
KW - Approximate results
KW - Cloud
KW - Interactive service
KW - Output quality
KW - Result correctness
KW - Synopsis
UR - http://www.scopus.com/inward/record.url?scp=84969835223&partnerID=8YFLogxK
U2 - 10.1145/2742854.2742858
DO - 10.1145/2742854.2742858
M3 - Conference contribution
AN - SCOPUS:84969835223
T3 - Proceedings of the 12th ACM International Conference on Computing Frontiers, CF 2015
BT - Proceedings of the 12th ACM International Conference on Computing Frontiers, CF 2015
PB - Association for Computing Machinery, Inc
Y2 - 18 May 2015 through 21 May 2015
ER -