Accurate differentially private deep learning on the edge

Rui Han; Dong Li; Junyan Ouyang; Chi Harold Liu; Guoren Wang; Dapeng Wu; Lydia Y. Chen

doi:10.1109/TPDS.2021.3064345

Accurate differentially private deep learning on the edge

Rui Han, Dong Li, Junyan Ouyang, Chi Harold Liu^*, Guoren Wang, Dapeng Wu, Lydia Y. Chen

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

18 Citations (Scopus)

Abstract

Deep learning (DL) models are increasingly built on federated edge participants holding local data. To enable insight extractions without the risk of information leakage, DL training is usually combined with differential privacy (DP). The core theme is to tradeoff learning accuracy by adding statistically calibrated noises, particularly to local gradients of edge learners, during model training. However, this privacy guarantee unfortunately degrades model accuracy due to edge learners' local noises, and the global noise aggregated at the central server. Existing DP frameworks for edge focus on local noise calibration via gradient clipping techniques, overlooking the heterogeneity and dynamic changes of local gradients, and their aggregated impact on accuracy. In this article, we present a systematical analysis that unveils the influential factors capable of mitigating local and aggregated noises, and design PrivateDL to leverage these factors in noise calibration so as to improve model accuracy while fulfilling privacy guarantee. PrivateDL features on: (i) sampling-based sensitivity estimation for local noise calibration and (ii) combining large batch sizes and critical data identification in global training. We implement PrivateDL on the popular Laplace/Gaussian DP mechanisms and demonstrate its effectiveness using Intel BigDL workloads, i.e., considerably improving model accuracy by up to 5X when comparing against existing DP frameworks.

Original language	English
Article number	9372811
Pages (from-to)	2231-2247
Number of pages	17
Journal	IEEE Transactions on Parallel and Distributed Systems
Volume	32
Issue number	9
DOIs	https://doi.org/10.1109/TPDS.2021.3064345
Publication status	Published - 1 Sept 2021

Keywords

Deep learning
Differential privacy
Federated learning
Model accuracy

Access to Document

10.1109/TPDS.2021.3064345

Cite this

Han, R., Li, D., Ouyang, J., Liu, C. H., Wang, G., Wu, D., & Chen, L. Y. (2021). Accurate differentially private deep learning on the edge. IEEE Transactions on Parallel and Distributed Systems, 32(9), 2231-2247. Article 9372811. https://doi.org/10.1109/TPDS.2021.3064345

@article{e588bcb0c3784d61a0c069d48c310959,

title = "Accurate differentially private deep learning on the edge",

abstract = "Deep learning (DL) models are increasingly built on federated edge participants holding local data. To enable insight extractions without the risk of information leakage, DL training is usually combined with differential privacy (DP). The core theme is to tradeoff learning accuracy by adding statistically calibrated noises, particularly to local gradients of edge learners, during model training. However, this privacy guarantee unfortunately degrades model accuracy due to edge learners' local noises, and the global noise aggregated at the central server. Existing DP frameworks for edge focus on local noise calibration via gradient clipping techniques, overlooking the heterogeneity and dynamic changes of local gradients, and their aggregated impact on accuracy. In this article, we present a systematical analysis that unveils the influential factors capable of mitigating local and aggregated noises, and design PrivateDL to leverage these factors in noise calibration so as to improve model accuracy while fulfilling privacy guarantee. PrivateDL features on: (i) sampling-based sensitivity estimation for local noise calibration and (ii) combining large batch sizes and critical data identification in global training. We implement PrivateDL on the popular Laplace/Gaussian DP mechanisms and demonstrate its effectiveness using Intel BigDL workloads, i.e., considerably improving model accuracy by up to 5X when comparing against existing DP frameworks.",

keywords = "Deep learning, Differential privacy, Federated learning, Model accuracy",

author = "Rui Han and Dong Li and Junyan Ouyang and Liu, {Chi Harold} and Guoren Wang and Dapeng Wu and Chen, {Lydia Y.}",

note = "Publisher Copyright: {\textcopyright} 1990-2012 IEEE.",

year = "2021",

month = sep,

day = "1",

doi = "10.1109/TPDS.2021.3064345",

language = "English",

volume = "32",

pages = "2231--2247",

journal = "IEEE Transactions on Parallel and Distributed Systems",

issn = "1045-9219",

publisher = "IEEE Computer Society",

number = "9",

}

TY - JOUR

T1 - Accurate differentially private deep learning on the edge

AU - Han, Rui

AU - Li, Dong

AU - Ouyang, Junyan

AU - Liu, Chi Harold

AU - Wang, Guoren

AU - Wu, Dapeng

AU - Chen, Lydia Y.

PY - 2021/9/1

Y1 - 2021/9/1

N2 - Deep learning (DL) models are increasingly built on federated edge participants holding local data. To enable insight extractions without the risk of information leakage, DL training is usually combined with differential privacy (DP). The core theme is to tradeoff learning accuracy by adding statistically calibrated noises, particularly to local gradients of edge learners, during model training. However, this privacy guarantee unfortunately degrades model accuracy due to edge learners' local noises, and the global noise aggregated at the central server. Existing DP frameworks for edge focus on local noise calibration via gradient clipping techniques, overlooking the heterogeneity and dynamic changes of local gradients, and their aggregated impact on accuracy. In this article, we present a systematical analysis that unveils the influential factors capable of mitigating local and aggregated noises, and design PrivateDL to leverage these factors in noise calibration so as to improve model accuracy while fulfilling privacy guarantee. PrivateDL features on: (i) sampling-based sensitivity estimation for local noise calibration and (ii) combining large batch sizes and critical data identification in global training. We implement PrivateDL on the popular Laplace/Gaussian DP mechanisms and demonstrate its effectiveness using Intel BigDL workloads, i.e., considerably improving model accuracy by up to 5X when comparing against existing DP frameworks.

AB - Deep learning (DL) models are increasingly built on federated edge participants holding local data. To enable insight extractions without the risk of information leakage, DL training is usually combined with differential privacy (DP). The core theme is to tradeoff learning accuracy by adding statistically calibrated noises, particularly to local gradients of edge learners, during model training. However, this privacy guarantee unfortunately degrades model accuracy due to edge learners' local noises, and the global noise aggregated at the central server. Existing DP frameworks for edge focus on local noise calibration via gradient clipping techniques, overlooking the heterogeneity and dynamic changes of local gradients, and their aggregated impact on accuracy. In this article, we present a systematical analysis that unveils the influential factors capable of mitigating local and aggregated noises, and design PrivateDL to leverage these factors in noise calibration so as to improve model accuracy while fulfilling privacy guarantee. PrivateDL features on: (i) sampling-based sensitivity estimation for local noise calibration and (ii) combining large batch sizes and critical data identification in global training. We implement PrivateDL on the popular Laplace/Gaussian DP mechanisms and demonstrate its effectiveness using Intel BigDL workloads, i.e., considerably improving model accuracy by up to 5X when comparing against existing DP frameworks.

KW - Deep learning

KW - Differential privacy

KW - Federated learning

KW - Model accuracy

UR - http://www.scopus.com/inward/record.url?scp=85102648308&partnerID=8YFLogxK

U2 - 10.1109/TPDS.2021.3064345

DO - 10.1109/TPDS.2021.3064345

M3 - Article

AN - SCOPUS:85102648308

SN - 1045-9219

VL - 32

SP - 2231

EP - 2247

JO - IEEE Transactions on Parallel and Distributed Systems

JF - IEEE Transactions on Parallel and Distributed Systems

IS - 9

M1 - 9372811

ER -

Accurate differentially private deep learning on the edge

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this