PrivTS: Differentially private frequent time-constrained sequential pattern mining

Yanhui Li; Guoren Wang; Ye Yuan; Xin Cao; Long Yuan; Xuemin Lin

doi:10.1007/978-3-319-91458-9_6

PrivTS: Differentially private frequent time-constrained sequential pattern mining

Yanhui Li^*, Guoren Wang, Ye Yuan, Xin Cao, Long Yuan, Xuemin Lin

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

10 Citations (Scopus)

Abstract

In this paper, we address the problem of mining time-constrained sequential patterns under the differential privacy framework. The mining of time-constrained sequential patterns from the sequence dataset has been widely studied, in which the transition time between adjacent items should not be too large to form frequent sequential patterns. A wide spectrum of applications can greatly benefit from such patterns, such as movement behavior analysis, targeted advertising, and POI recommendation. Improper releasing and use of such patterns could jeopardize the individually’s privacy, which motivates us to apply differential privacy to mining such patterns. It is a challenging task due to the inherent sequentiality and high complexity. Towards this end, we propose a two-phase algorithm PrivTS, which consists of sample-based filtering and count refining modules. The former takes advantage of an improved sparse vector technique to retrieve a set of potentially frequent sequential patterns. Utilizing this information, the latter computes their noisy supports and detects the final frequent patterns. Extensive experiments conducted on real-world datasets demonstrate that our approach maintains high utility while providing privacy guarantees.

Original language	English
Title of host publication	Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Proceedings
Editors	Jian Pei, Shazia Sadiq, Jianxin Li, Yannis Manolopoulos
Publisher	Springer Verlag
Pages	92-111
Number of pages	20
ISBN (Print)	9783319914572
DOIs	https://doi.org/10.1007/978-3-319-91458-9_6
Publication status	Published - 2018
Event	23rd International Conference on Database Systems for Advanced Applications, DASFAA 2018 - Gold Coast, Australia Duration: 21 May 2018 → 24 May 2018

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	10828 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	23rd International Conference on Database Systems for Advanced Applications, DASFAA 2018
Country/Territory	Australia
City	Gold Coast
Period	21/05/18 → 24/05/18

Access to Document

10.1007/978-3-319-91458-9_6

Cite this

Li, Y., Wang, G., Yuan, Y., Cao, X., Yuan, L., & Lin, X. (2018). PrivTS: Differentially private frequent time-constrained sequential pattern mining. In J. Pei, S. Sadiq, J. Li, & Y. Manolopoulos (Eds.), Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Proceedings (pp. 92-111). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10828 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-319-91458-9_6

Li, Yanhui ; Wang, Guoren ; Yuan, Ye et al. / PrivTS : Differentially private frequent time-constrained sequential pattern mining. Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Proceedings. editor / Jian Pei ; Shazia Sadiq ; Jianxin Li ; Yannis Manolopoulos. Springer Verlag, 2018. pp. 92-111 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{dc006e4d7339428fb30634a4a6b8a05d,

title = "PrivTS: Differentially private frequent time-constrained sequential pattern mining",

abstract = "In this paper, we address the problem of mining time-constrained sequential patterns under the differential privacy framework. The mining of time-constrained sequential patterns from the sequence dataset has been widely studied, in which the transition time between adjacent items should not be too large to form frequent sequential patterns. A wide spectrum of applications can greatly benefit from such patterns, such as movement behavior analysis, targeted advertising, and POI recommendation. Improper releasing and use of such patterns could jeopardize the individually{\textquoteright}s privacy, which motivates us to apply differential privacy to mining such patterns. It is a challenging task due to the inherent sequentiality and high complexity. Towards this end, we propose a two-phase algorithm PrivTS, which consists of sample-based filtering and count refining modules. The former takes advantage of an improved sparse vector technique to retrieve a set of potentially frequent sequential patterns. Utilizing this information, the latter computes their noisy supports and detects the final frequent patterns. Extensive experiments conducted on real-world datasets demonstrate that our approach maintains high utility while providing privacy guarantees.",

author = "Yanhui Li and Guoren Wang and Ye Yuan and Xin Cao and Long Yuan and Xuemin Lin",

note = "Publisher Copyright: {\textcopyright} Springer International Publishing AG, part of Springer Nature 2018.; 23rd International Conference on Database Systems for Advanced Applications, DASFAA 2018 ; Conference date: 21-05-2018 Through 24-05-2018",

year = "2018",

doi = "10.1007/978-3-319-91458-9_6",

language = "English",

isbn = "9783319914572",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "92--111",

editor = "Jian Pei and Shazia Sadiq and Jianxin Li and Yannis Manolopoulos",

booktitle = "Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Proceedings",

address = "Germany",

}

Li, Y, Wang, G, Yuan, Y, Cao, X, Yuan, L & Lin, X 2018, PrivTS: Differentially private frequent time-constrained sequential pattern mining. in J Pei, S Sadiq, J Li & Y Manolopoulos (eds), Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10828 LNCS, Springer Verlag, pp. 92-111, 23rd International Conference on Database Systems for Advanced Applications, DASFAA 2018, Gold Coast, Australia, 21/05/18. https://doi.org/10.1007/978-3-319-91458-9_6

PrivTS: Differentially private frequent time-constrained sequential pattern mining. / Li, Yanhui; Wang, Guoren; Yuan, Ye et al.
Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Proceedings. ed. / Jian Pei; Shazia Sadiq; Jianxin Li; Yannis Manolopoulos. Springer Verlag, 2018. p. 92-111 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 10828 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - PrivTS

T2 - 23rd International Conference on Database Systems for Advanced Applications, DASFAA 2018

AU - Li, Yanhui

AU - Wang, Guoren

AU - Yuan, Ye

AU - Cao, Xin

AU - Yuan, Long

AU - Lin, Xuemin

PY - 2018

Y1 - 2018

N2 - In this paper, we address the problem of mining time-constrained sequential patterns under the differential privacy framework. The mining of time-constrained sequential patterns from the sequence dataset has been widely studied, in which the transition time between adjacent items should not be too large to form frequent sequential patterns. A wide spectrum of applications can greatly benefit from such patterns, such as movement behavior analysis, targeted advertising, and POI recommendation. Improper releasing and use of such patterns could jeopardize the individually’s privacy, which motivates us to apply differential privacy to mining such patterns. It is a challenging task due to the inherent sequentiality and high complexity. Towards this end, we propose a two-phase algorithm PrivTS, which consists of sample-based filtering and count refining modules. The former takes advantage of an improved sparse vector technique to retrieve a set of potentially frequent sequential patterns. Utilizing this information, the latter computes their noisy supports and detects the final frequent patterns. Extensive experiments conducted on real-world datasets demonstrate that our approach maintains high utility while providing privacy guarantees.

AB - In this paper, we address the problem of mining time-constrained sequential patterns under the differential privacy framework. The mining of time-constrained sequential patterns from the sequence dataset has been widely studied, in which the transition time between adjacent items should not be too large to form frequent sequential patterns. A wide spectrum of applications can greatly benefit from such patterns, such as movement behavior analysis, targeted advertising, and POI recommendation. Improper releasing and use of such patterns could jeopardize the individually’s privacy, which motivates us to apply differential privacy to mining such patterns. It is a challenging task due to the inherent sequentiality and high complexity. Towards this end, we propose a two-phase algorithm PrivTS, which consists of sample-based filtering and count refining modules. The former takes advantage of an improved sparse vector technique to retrieve a set of potentially frequent sequential patterns. Utilizing this information, the latter computes their noisy supports and detects the final frequent patterns. Extensive experiments conducted on real-world datasets demonstrate that our approach maintains high utility while providing privacy guarantees.

UR - http://www.scopus.com/inward/record.url?scp=85048945220&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-91458-9_6

DO - 10.1007/978-3-319-91458-9_6

M3 - Conference contribution

AN - SCOPUS:85048945220

SN - 9783319914572

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 92

EP - 111

BT - Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Proceedings

A2 - Pei, Jian

A2 - Sadiq, Shazia

A2 - Li, Jianxin

A2 - Manolopoulos, Yannis

PB - Springer Verlag

Y2 - 21 May 2018 through 24 May 2018

ER -

Li Y, Wang G, Yuan Y, Cao X, Yuan L, Lin X. PrivTS: Differentially private frequent time-constrained sequential pattern mining. In Pei J, Sadiq S, Li J, Manolopoulos Y, editors, Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Proceedings. Springer Verlag. 2018. p. 92-111. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-91458-9_6

PrivTS: Differentially private frequent time-constrained sequential pattern mining

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this