Privacy-Preserving Machine Learning Training in IoT Aggregation Scenarios

Liehuang Zhu; Xiangyun Tang; Meng Shen; Feng Gao; Jie Zhang; Xiaojiang Du

doi:10.1109/JIOT.2021.3060764

Privacy-Preserving Machine Learning Training in IoT Aggregation Scenarios

Liehuang Zhu, Xiangyun Tang, Meng Shen^*, Feng Gao, Jie Zhang, Xiaojiang Du

^*此作品的通讯作者

网络空间安全学院

科研成果: 期刊稿件 › 文章 › 同行评审

24 引用（Scopus）

摘要

In developing smart city, the growing popularity of machine learning (ML) that appreciates high-quality training data sets generated from diverse Internet-of-Things (IoT) devices raises natural questions about the privacy guarantees that can be provided in such settings. Privacy-preserving ML training in an aggregation scenario enables a model demander to securely train ML models with the sensitive IoT data gathered from IoT devices. The existing solutions are generally server aided, cannot deal with the collusion threat between the servers or between the servers and data owners, and do not match the delicate environments of IoT. We propose a privacy-preserving ML training framework named Heda that consists of a library of building blocks based on partial homomorphic encryption, which enables constructing multiple privacy-preserving ML training protocols for the aggregation scenario without the assistance of untrusted servers, and defending the security under collusion situations. Rigorous security analysis demonstrates the proposed protocols can protect the privacy of each participant in the honest-but-curious model and guarantee the security under most collusion situations. Extensive experiments validate the efficiency of Heda, which achieves privacy-preserving ML training without losing the model accuracy.

源语言	英语
文章编号	9359659
页（从-至）	12106-12118
页数	13
期刊	IEEE Internet of Things Journal
卷	8
期	15
DOI	https://doi.org/10.1109/JIOT.2021.3060764
出版状态	已出版 - 1 8月 2021

联合国可持续发展目标

此成果有助于实现下列可持续发展目标：

访问文件

10.1109/JIOT.2021.3060764

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{9efd1c2e4ff24b2c9c7290d540ff48fe,

title = "Privacy-Preserving Machine Learning Training in IoT Aggregation Scenarios",

abstract = "In developing smart city, the growing popularity of machine learning (ML) that appreciates high-quality training data sets generated from diverse Internet-of-Things (IoT) devices raises natural questions about the privacy guarantees that can be provided in such settings. Privacy-preserving ML training in an aggregation scenario enables a model demander to securely train ML models with the sensitive IoT data gathered from IoT devices. The existing solutions are generally server aided, cannot deal with the collusion threat between the servers or between the servers and data owners, and do not match the delicate environments of IoT. We propose a privacy-preserving ML training framework named Heda that consists of a library of building blocks based on partial homomorphic encryption, which enables constructing multiple privacy-preserving ML training protocols for the aggregation scenario without the assistance of untrusted servers, and defending the security under collusion situations. Rigorous security analysis demonstrates the proposed protocols can protect the privacy of each participant in the honest-but-curious model and guarantee the security under most collusion situations. Extensive experiments validate the efficiency of Heda, which achieves privacy-preserving ML training without losing the model accuracy.",

keywords = "Homomorphic encryption, Internet-of-Things (IoT) data, machine learning (ML), modular sequential composition, secure two-party computation",

author = "Liehuang Zhu and Xiangyun Tang and Meng Shen and Feng Gao and Jie Zhang and Xiaojiang Du",

note = "Publisher Copyright: {\textcopyright} 2014 IEEE.",

year = "2021",

month = aug,

day = "1",

doi = "10.1109/JIOT.2021.3060764",

language = "English",

volume = "8",

pages = "12106--12118",

journal = "IEEE Internet of Things Journal",

issn = "2327-4662",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "15",

}

TY - JOUR

T1 - Privacy-Preserving Machine Learning Training in IoT Aggregation Scenarios

AU - Zhu, Liehuang

AU - Tang, Xiangyun

AU - Shen, Meng

AU - Gao, Feng

AU - Zhang, Jie

AU - Du, Xiaojiang

PY - 2021/8/1

Y1 - 2021/8/1

N2 - In developing smart city, the growing popularity of machine learning (ML) that appreciates high-quality training data sets generated from diverse Internet-of-Things (IoT) devices raises natural questions about the privacy guarantees that can be provided in such settings. Privacy-preserving ML training in an aggregation scenario enables a model demander to securely train ML models with the sensitive IoT data gathered from IoT devices. The existing solutions are generally server aided, cannot deal with the collusion threat between the servers or between the servers and data owners, and do not match the delicate environments of IoT. We propose a privacy-preserving ML training framework named Heda that consists of a library of building blocks based on partial homomorphic encryption, which enables constructing multiple privacy-preserving ML training protocols for the aggregation scenario without the assistance of untrusted servers, and defending the security under collusion situations. Rigorous security analysis demonstrates the proposed protocols can protect the privacy of each participant in the honest-but-curious model and guarantee the security under most collusion situations. Extensive experiments validate the efficiency of Heda, which achieves privacy-preserving ML training without losing the model accuracy.

AB - In developing smart city, the growing popularity of machine learning (ML) that appreciates high-quality training data sets generated from diverse Internet-of-Things (IoT) devices raises natural questions about the privacy guarantees that can be provided in such settings. Privacy-preserving ML training in an aggregation scenario enables a model demander to securely train ML models with the sensitive IoT data gathered from IoT devices. The existing solutions are generally server aided, cannot deal with the collusion threat between the servers or between the servers and data owners, and do not match the delicate environments of IoT. We propose a privacy-preserving ML training framework named Heda that consists of a library of building blocks based on partial homomorphic encryption, which enables constructing multiple privacy-preserving ML training protocols for the aggregation scenario without the assistance of untrusted servers, and defending the security under collusion situations. Rigorous security analysis demonstrates the proposed protocols can protect the privacy of each participant in the honest-but-curious model and guarantee the security under most collusion situations. Extensive experiments validate the efficiency of Heda, which achieves privacy-preserving ML training without losing the model accuracy.

KW - Homomorphic encryption

KW - Internet-of-Things (IoT) data

KW - machine learning (ML)

KW - modular sequential composition

KW - secure two-party computation

UR - http://www.scopus.com/inward/record.url?scp=85101775417&partnerID=8YFLogxK

U2 - 10.1109/JIOT.2021.3060764

DO - 10.1109/JIOT.2021.3060764

M3 - Article

AN - SCOPUS:85101775417

SN - 2327-4662

VL - 8

SP - 12106

EP - 12118

JO - IEEE Internet of Things Journal

JF - IEEE Internet of Things Journal

IS - 15

M1 - 9359659

ER -

Privacy-Preserving Machine Learning Training in IoT Aggregation Scenarios

摘要

联合国可持续发展目标

访问文件

其它文件与链接

指纹

引用此