Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification

Yong Lin; Peng Jiang; Keke Gai; Liehuang Zhu

doi:10.1109/BigDataSecurity62737.2024.00018

Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification

Yong Lin, Peng Jiang^*, Keke Gai, Liehuang Zhu

^*Corresponding author for this work

School of Cyberspace Science and Technology

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Text classification is a foundational task in natural language processing that involves categorizing raw text into pre-defined classes. This task holds significant importance in various applications, including but not limited to sentiment analysis and intent detection. With collaborative inference of multiple models, text classification may achieve an improved performance compared to the single model. However, if multiple models have access to the input text directly, it may create challenges on the privacy of sensitive data or model information. It is not easy to realize collaborative inference while preserving the privacy. This paper presents PPJP, a privacy-preserving joint system that helps achieve private collaborative inference in text classification with machine learning. Our method to instantiate it, is based on secure multiparty computation (MPC) and differential privacy (DP). We fulfill the privacy and scalability of text classification under multiple models inference. Secret-sharing-based MPC is used to protect the input and model parameters, while DP is used to protect against membership inference attack. We implement and evaluate prototype of our PPJP system based on the Twitter dataset. Experimental results show that text classification can guarantee privacy for model owners and clients with 54% inference accuracy. It achieves a balance between privacy and accuracy in case of collaborative inference.

Original language	English
Title of host publication	Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	58-65
Number of pages	8
ISBN (Electronic)	9798350389524
DOIs	https://doi.org/10.1109/BigDataSecurity62737.2024.00018
Publication status	Published - 2024
Event	10th IEEE Conference on Big Data Security on Cloud, BigDataSecurity 2024 - New York City, United States Duration: 10 May 2024 → 12 May 2024

Publication series

Name	Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024

Conference

Conference	10th IEEE Conference on Big Data Security on Cloud, BigDataSecurity 2024
Country/Territory	United States
City	New York City
Period	10/05/24 → 12/05/24

Keywords

Accuracy Optimization
Collaborative Inference
Secure Multiparty Computation
Text Classification
Text and Model Privacy

Access to Document

10.1109/BigDataSecurity62737.2024.00018

Cite this

Lin, Y., Jiang, P., Gai, K., & Zhu, L. (2024). Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification. In Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024 (pp. 58-65). (Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/BigDataSecurity62737.2024.00018

Lin, Yong ; Jiang, Peng ; Gai, Keke et al. / Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification. Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024. Institute of Electrical and Electronics Engineers Inc., 2024. pp. 58-65 (Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024).

@inproceedings{21f47c9c524c4aaf9ee855ba9da40a1f,

title = "Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification",

abstract = "Text classification is a foundational task in natural language processing that involves categorizing raw text into pre-defined classes. This task holds significant importance in various applications, including but not limited to sentiment analysis and intent detection. With collaborative inference of multiple models, text classification may achieve an improved performance compared to the single model. However, if multiple models have access to the input text directly, it may create challenges on the privacy of sensitive data or model information. It is not easy to realize collaborative inference while preserving the privacy. This paper presents PPJP, a privacy-preserving joint system that helps achieve private collaborative inference in text classification with machine learning. Our method to instantiate it, is based on secure multiparty computation (MPC) and differential privacy (DP). We fulfill the privacy and scalability of text classification under multiple models inference. Secret-sharing-based MPC is used to protect the input and model parameters, while DP is used to protect against membership inference attack. We implement and evaluate prototype of our PPJP system based on the Twitter dataset. Experimental results show that text classification can guarantee privacy for model owners and clients with 54% inference accuracy. It achieves a balance between privacy and accuracy in case of collaborative inference.",

keywords = "Accuracy Optimization, Collaborative Inference, Secure Multiparty Computation, Text Classification, Text and Model Privacy",

author = "Yong Lin and Peng Jiang and Keke Gai and Liehuang Zhu",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 10th IEEE Conference on Big Data Security on Cloud, BigDataSecurity 2024 ; Conference date: 10-05-2024 Through 12-05-2024",

year = "2024",

doi = "10.1109/BigDataSecurity62737.2024.00018",

language = "English",

series = "Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "58--65",

booktitle = "Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024",

address = "United States",

}

Lin, Y, Jiang, P, Gai, K & Zhu, L 2024, Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification. in Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024. Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024, Institute of Electrical and Electronics Engineers Inc., pp. 58-65, 10th IEEE Conference on Big Data Security on Cloud, BigDataSecurity 2024, New York City, United States, 10/05/24. https://doi.org/10.1109/BigDataSecurity62737.2024.00018

Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification. / Lin, Yong; Jiang, Peng; Gai, Keke et al.
Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024. Institute of Electrical and Electronics Engineers Inc., 2024. p. 58-65 (Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification

AU - Lin, Yong

AU - Jiang, Peng

AU - Gai, Keke

AU - Zhu, Liehuang

PY - 2024

Y1 - 2024

N2 - Text classification is a foundational task in natural language processing that involves categorizing raw text into pre-defined classes. This task holds significant importance in various applications, including but not limited to sentiment analysis and intent detection. With collaborative inference of multiple models, text classification may achieve an improved performance compared to the single model. However, if multiple models have access to the input text directly, it may create challenges on the privacy of sensitive data or model information. It is not easy to realize collaborative inference while preserving the privacy. This paper presents PPJP, a privacy-preserving joint system that helps achieve private collaborative inference in text classification with machine learning. Our method to instantiate it, is based on secure multiparty computation (MPC) and differential privacy (DP). We fulfill the privacy and scalability of text classification under multiple models inference. Secret-sharing-based MPC is used to protect the input and model parameters, while DP is used to protect against membership inference attack. We implement and evaluate prototype of our PPJP system based on the Twitter dataset. Experimental results show that text classification can guarantee privacy for model owners and clients with 54% inference accuracy. It achieves a balance between privacy and accuracy in case of collaborative inference.

AB - Text classification is a foundational task in natural language processing that involves categorizing raw text into pre-defined classes. This task holds significant importance in various applications, including but not limited to sentiment analysis and intent detection. With collaborative inference of multiple models, text classification may achieve an improved performance compared to the single model. However, if multiple models have access to the input text directly, it may create challenges on the privacy of sensitive data or model information. It is not easy to realize collaborative inference while preserving the privacy. This paper presents PPJP, a privacy-preserving joint system that helps achieve private collaborative inference in text classification with machine learning. Our method to instantiate it, is based on secure multiparty computation (MPC) and differential privacy (DP). We fulfill the privacy and scalability of text classification under multiple models inference. Secret-sharing-based MPC is used to protect the input and model parameters, while DP is used to protect against membership inference attack. We implement and evaluate prototype of our PPJP system based on the Twitter dataset. Experimental results show that text classification can guarantee privacy for model owners and clients with 54% inference accuracy. It achieves a balance between privacy and accuracy in case of collaborative inference.

KW - Accuracy Optimization

KW - Collaborative Inference

KW - Secure Multiparty Computation

KW - Text Classification

KW - Text and Model Privacy

UR - http://www.scopus.com/inward/record.url?scp=85197731152&partnerID=8YFLogxK

U2 - 10.1109/BigDataSecurity62737.2024.00018

DO - 10.1109/BigDataSecurity62737.2024.00018

M3 - Conference contribution

AN - SCOPUS:85197731152

T3 - Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024

SP - 58

EP - 65

BT - Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 10th IEEE Conference on Big Data Security on Cloud, BigDataSecurity 2024

Y2 - 10 May 2024 through 12 May 2024

ER -

Lin Y, Jiang P, Gai K, Zhu L. Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification. In Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024. Institute of Electrical and Electronics Engineers Inc. 2024. p. 58-65. (Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024). doi: 10.1109/BigDataSecurity62737.2024.00018

Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this