Enabling Differentially Private in Big Data Machine Learning

Dong Li, Xiaojiang Zuo, Rui Han

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Using the machine learning technology to explore the potential value of Big Data brings us into a smarter world, and the way data is mined through data sharing patterns also threatens the privacy of personal data. Differential privacy is a prevalent mechanism to effectively protect the personal data privacy due to the strict and the provable privacy definition, although there are several achievements have reached by combining the differential privacy and traditional machine learning algorithms in a stand-alone mode, little to talk about the distributed environment. To fill this gap, this paper proposes a method to embed the differential privacy mechanism into distributed platform, respectively implements the DPLloyd, GUPT k-means and GUPT logistic regression on the platform of Spark. The evaluation demonstrates that the approach barely interferes the effect of distributed machine learning algorithms and thus achieves the goal of differential privacy.

源语言英语
主期刊名ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9781728123455
DOI
出版状态已出版 - 12月 2019
活动2019 IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2019 - Chongqing, 中国
期限: 11 12月 201913 12月 2019

出版系列

姓名ICSIDP 2019 - IEEE International Conference on Signal, Information and Data Processing 2019

会议

会议2019 IEEE International Conference on Signal, Information and Data Processing, ICSIDP 2019
国家/地区中国
Chongqing
时期11/12/1913/12/19

指纹

探究 'Enabling Differentially Private in Big Data Machine Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此