Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification

Yong Lin, Peng Jiang*, Keke Gai, Liehuang Zhu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Text classification is a foundational task in natural language processing that involves categorizing raw text into pre-defined classes. This task holds significant importance in various applications, including but not limited to sentiment analysis and intent detection. With collaborative inference of multiple models, text classification may achieve an improved performance compared to the single model. However, if multiple models have access to the input text directly, it may create challenges on the privacy of sensitive data or model information. It is not easy to realize collaborative inference while preserving the privacy. This paper presents PPJP, a privacy-preserving joint system that helps achieve private collaborative inference in text classification with machine learning. Our method to instantiate it, is based on secure multiparty computation (MPC) and differential privacy (DP). We fulfill the privacy and scalability of text classification under multiple models inference. Secret-sharing-based MPC is used to protect the input and model parameters, while DP is used to protect against membership inference attack. We implement and evaluate prototype of our PPJP system based on the Twitter dataset. Experimental results show that text classification can guarantee privacy for model owners and clients with 54% inference accuracy. It achieves a balance between privacy and accuracy in case of collaborative inference.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages58-65
Number of pages8
ISBN (Electronic)9798350389524
DOIs
Publication statusPublished - 2024
Event10th IEEE Conference on Big Data Security on Cloud, BigDataSecurity 2024 - New York City, United States
Duration: 10 May 202412 May 2024

Publication series

NameProceedings - 2024 IEEE 10th Conference on Big Data Security on Cloud, BigDataSecurity 2024

Conference

Conference10th IEEE Conference on Big Data Security on Cloud, BigDataSecurity 2024
Country/TerritoryUnited States
CityNew York City
Period10/05/2412/05/24

Keywords

  • Accuracy Optimization
  • Collaborative Inference
  • Secure Multiparty Computation
  • Text Classification
  • Text and Model Privacy

Fingerprint

Dive into the research topics of 'Exploiting Multi-Model Collaborative Inference for Privacy Enhancement in Text Classification'. Together they form a unique fingerprint.

Cite this