TY - GEN
T1 - Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification
AU - Lin, Yong
AU - Jiang, Peng
AU - Gai, Keke
AU - Zhu, Liehuang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Text classification is a foundational task in natural language processing that involves categorizing raw text into pre-defined classes. This task holds significant importance in various applications, including but not limited to sentiment analysis and intent detection. With collaborative inference of multiple models, text classification may achieve an improved performance compared to the single model. However, if multiple models have access to the input text directly, it may create challenges on the privacy of sensitive data or model information. It is not easy to realize collaborative inference while preserving the privacy. This paper presents PPJP, a privacy-preserving joint system that helps achieve private collaborative inference in text classification with machine learning. Our method to instantiate it, is based on secure multiparty computation (MPC) and differential privacy (DP). We fulfill the privacy and scalability of text classification under multiple models inference. Secret-sharing-based MPC is used to protect the input and model parameters, while DP is used to protect against membership inference attack. We implement and evaluate prototype of our PPJP system based on the Twitter dataset. Experimental results show that text classification can guarantee privacy for model owners and clients with 54% inference accuracy. It achieves a balance between privacy and accuracy in case of collaborative inference.
AB - Text classification is a foundational task in natural language processing that involves categorizing raw text into pre-defined classes. This task holds significant importance in various applications, including but not limited to sentiment analysis and intent detection. With collaborative inference of multiple models, text classification may achieve an improved performance compared to the single model. However, if multiple models have access to the input text directly, it may create challenges on the privacy of sensitive data or model information. It is not easy to realize collaborative inference while preserving the privacy. This paper presents PPJP, a privacy-preserving joint system that helps achieve private collaborative inference in text classification with machine learning. Our method to instantiate it, is based on secure multiparty computation (MPC) and differential privacy (DP). We fulfill the privacy and scalability of text classification under multiple models inference. Secret-sharing-based MPC is used to protect the input and model parameters, while DP is used to protect against membership inference attack. We implement and evaluate prototype of our PPJP system based on the Twitter dataset. Experimental results show that text classification can guarantee privacy for model owners and clients with 54% inference accuracy. It achieves a balance between privacy and accuracy in case of collaborative inference.
KW - Accuracy Optimization
KW - Collaborative Inference
KW - Secure Multiparty Computation
KW - Text Classification
KW - Text and Model Privacy
UR - http://www.scopus.com/inward/record.url?scp=85197731152&partnerID=8YFLogxK
U2 - 10.1109/BigDataSecurity62737.2024.00018
DO - 10.1109/BigDataSecurity62737.2024.00018
M3 - Conference contribution
AN - SCOPUS:85197731152
T3 - Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024
SP - 58
EP - 65
BT - Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 10th IEEE Conference on Big Data Security on Cloud, BigDataSecurity 2024
Y2 - 10 May 2024 through 12 May 2024
ER -