Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification

Yong Lin, Peng Jiang*, Keke Gai, Liehuang Zhu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Text classification is a foundational task in natural language processing that involves categorizing raw text into pre-defined classes. This task holds significant importance in various applications, including but not limited to sentiment analysis and intent detection. With collaborative inference of multiple models, text classification may achieve an improved performance compared to the single model. However, if multiple models have access to the input text directly, it may create challenges on the privacy of sensitive data or model information. It is not easy to realize collaborative inference while preserving the privacy. This paper presents PPJP, a privacy-preserving joint system that helps achieve private collaborative inference in text classification with machine learning. Our method to instantiate it, is based on secure multiparty computation (MPC) and differential privacy (DP). We fulfill the privacy and scalability of text classification under multiple models inference. Secret-sharing-based MPC is used to protect the input and model parameters, while DP is used to protect against membership inference attack. We implement and evaluate prototype of our PPJP system based on the Twitter dataset. Experimental results show that text classification can guarantee privacy for model owners and clients with 54% inference accuracy. It achieves a balance between privacy and accuracy in case of collaborative inference.

源语言英语
主期刊名Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024
出版商Institute of Electrical and Electronics Engineers Inc.
58-65
页数8
ISBN(电子版)9798350389524
DOI
出版状态已出版 - 2024
活动10th IEEE Conference on Big Data Security on Cloud, BigDataSecurity 2024 - New York City, 美国
期限: 10 5月 202412 5月 2024

出版系列

姓名Proceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024

会议

会议10th IEEE Conference on Big Data Security on Cloud, BigDataSecurity 2024
国家/地区美国
New York City
时期10/05/2412/05/24

指纹

探究 'Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification' 的科研主题。它们共同构成独一无二的指纹。

引用此