TY - GEN
T1 - Efficient algorithms for distance-based representative skyline computation in 2D space
AU - Cai, Taotao
AU - Li, Rong Hua
AU - Yu, Jeffrey Xu
AU - Mao, Rui
AU - Cai, Yadi
N1 - Publisher Copyright:
© Springer International Publishing Switzerland 2015.
PY - 2015
Y1 - 2015
N2 - Representative skyline computation is a fundamental issue in database area, which has attracted much attention in recent years. A notable definition of representative skyline is the distance-based representative skyline (DBRS). Given an integer k, a DBRS includes k representative skyline points that aims at minimizing the maximal distance between a non-representative skyline point and its nearest representative. In the 2D space, the state-of-the-art algorithm to compute the DBRS is based on dynamic programming (DP) which takes O(km2) time complexity, where m is the number of skyline points. Clearly, such a DP-based algorithm cannot be used for handling large scale dataset due to the quadratic time cost. To overcome this problem, in this paper, we propose a new approximate algorithm called ARS, and a new exact algorithm named PSRS, based on a carefully-designed parametric search technique. We show that the ARS algorithm can guarantee a solution that is at most ε larger than the optimal solution. The proposed ARS and PSRS algorithms run in O(k log2 mlog(T/ε)) and O(k2 log3 m) time respectively, where T is no more than the maximal distance between any two skyline points. We conduct extensive experimental studies over both synthetic and real-world datasets, and the results demonstrate the efficiency and effectiveness of the proposed algorithms.
AB - Representative skyline computation is a fundamental issue in database area, which has attracted much attention in recent years. A notable definition of representative skyline is the distance-based representative skyline (DBRS). Given an integer k, a DBRS includes k representative skyline points that aims at minimizing the maximal distance between a non-representative skyline point and its nearest representative. In the 2D space, the state-of-the-art algorithm to compute the DBRS is based on dynamic programming (DP) which takes O(km2) time complexity, where m is the number of skyline points. Clearly, such a DP-based algorithm cannot be used for handling large scale dataset due to the quadratic time cost. To overcome this problem, in this paper, we propose a new approximate algorithm called ARS, and a new exact algorithm named PSRS, based on a carefully-designed parametric search technique. We show that the ARS algorithm can guarantee a solution that is at most ε larger than the optimal solution. The proposed ARS and PSRS algorithms run in O(k log2 mlog(T/ε)) and O(k2 log3 m) time respectively, where T is no more than the maximal distance between any two skyline points. We conduct extensive experimental studies over both synthetic and real-world datasets, and the results demonstrate the efficiency and effectiveness of the proposed algorithms.
UR - http://www.scopus.com/inward/record.url?scp=84950260068&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-25255-1_10
DO - 10.1007/978-3-319-25255-1_10
M3 - Conference contribution
AN - SCOPUS:84950260068
SN - 9783319252544
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 116
EP - 128
BT - Web Technologies and Applications - 17th Asia-PacificWeb Conference,APWeb 2015, Proceedings
A2 - Cheng, Reynold
A2 - Cui, Bin
A2 - Zhang, Zhenjie
A2 - Cai, Ruichu
A2 - Xu, Jia
PB - Springer Verlag
T2 - 17th Asia-PacificWeb Conference, APWeb 2015
Y2 - 18 September 2015 through 20 September 2015
ER -