TY - GEN
T1 - PrivTS
T2 - 23rd International Conference on Database Systems for Advanced Applications, DASFAA 2018
AU - Li, Yanhui
AU - Wang, Guoren
AU - Yuan, Ye
AU - Cao, Xin
AU - Yuan, Long
AU - Lin, Xuemin
N1 - Publisher Copyright:
© Springer International Publishing AG, part of Springer Nature 2018.
PY - 2018
Y1 - 2018
N2 - In this paper, we address the problem of mining time-constrained sequential patterns under the differential privacy framework. The mining of time-constrained sequential patterns from the sequence dataset has been widely studied, in which the transition time between adjacent items should not be too large to form frequent sequential patterns. A wide spectrum of applications can greatly benefit from such patterns, such as movement behavior analysis, targeted advertising, and POI recommendation. Improper releasing and use of such patterns could jeopardize the individually’s privacy, which motivates us to apply differential privacy to mining such patterns. It is a challenging task due to the inherent sequentiality and high complexity. Towards this end, we propose a two-phase algorithm PrivTS, which consists of sample-based filtering and count refining modules. The former takes advantage of an improved sparse vector technique to retrieve a set of potentially frequent sequential patterns. Utilizing this information, the latter computes their noisy supports and detects the final frequent patterns. Extensive experiments conducted on real-world datasets demonstrate that our approach maintains high utility while providing privacy guarantees.
AB - In this paper, we address the problem of mining time-constrained sequential patterns under the differential privacy framework. The mining of time-constrained sequential patterns from the sequence dataset has been widely studied, in which the transition time between adjacent items should not be too large to form frequent sequential patterns. A wide spectrum of applications can greatly benefit from such patterns, such as movement behavior analysis, targeted advertising, and POI recommendation. Improper releasing and use of such patterns could jeopardize the individually’s privacy, which motivates us to apply differential privacy to mining such patterns. It is a challenging task due to the inherent sequentiality and high complexity. Towards this end, we propose a two-phase algorithm PrivTS, which consists of sample-based filtering and count refining modules. The former takes advantage of an improved sparse vector technique to retrieve a set of potentially frequent sequential patterns. Utilizing this information, the latter computes their noisy supports and detects the final frequent patterns. Extensive experiments conducted on real-world datasets demonstrate that our approach maintains high utility while providing privacy guarantees.
UR - http://www.scopus.com/inward/record.url?scp=85048945220&partnerID=8YFLogxK
U2 - 10.1007/978-3-319-91458-9_6
DO - 10.1007/978-3-319-91458-9_6
M3 - Conference contribution
AN - SCOPUS:85048945220
SN - 9783319914572
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 92
EP - 111
BT - Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Proceedings
A2 - Pei, Jian
A2 - Sadiq, Shazia
A2 - Li, Jianxin
A2 - Manolopoulos, Yannis
PB - Springer Verlag
Y2 - 21 May 2018 through 24 May 2018
ER -