面向征信数据安全共享的SVM训练机制

Meng Shen, Jie Zhang, Lie Huang Zhu*, Ke Xu, Kai Xiang Zhang, Hui Zhong Li, Xiang Yun Tang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

2 引用 (Scopus)

摘要

In the credit reporting industry, the richness and diversity of credit reporting data is extremely important for the development of credit evaluation. However, credit data owned by credit reporting agencies, especially small credit reporting agencies, has issues like incomplete content, incomplete types, and insufficient instance numbers. Therefore, data sharing among credit reporting agencies is very important. In practical application scenarios, credit data has the characteristics of high value, strong privacy, and easy to be copied without authorization. These characteristics will cause great security challenges when sharing credit data. To solve this problem, this paper proposes a SVM training mechanism for secure sharing of credit data. Meanwhile we design a system prototype based on this mechanism, as showed in Figure 3 in the manuscript. This mechanism is based on the consortium blockchain and the addition homomorphic encryption scheme Paillier. With the decentralization of blockchain technology, this mechanism does not need to rely on any trusted third party during model training. At the same time, through secure collaborative computing between credit reporting agencies, the mechanism can meet the credit evaluation needs of the model trainer without revealing data privacy. Firstly, the shared data is stored on the blockchain and is encrypted to ensure that the data is secure and cannot be tampered. This process is completed through smart contracts, without the need for a third party as a data sharing platform. Secondly, based on the addition homomorphic encryption algorithm Paillier, this paper implements various secure operations in the SVM training process based on the stochastic gradient descent algorithm, and designs a secure SVM training algorithm according to the training process. The algorithm flow is shown in Algorithm 2. Based on this algorithm, the credit reporting agencies participating to the calculation can perform operations on the shared encrypted data, ensuring that the model trainer can train the credit evaluation model without leaking the original data. During the training process, only the data provider and a model trainer participate in the calculation. The calculation based on the encrypted data does not require the assistance of a third party, which avoids the risk of privacy leakage caused by the introduction of a third party. The mechanism proposed in this paper is verified by security analysis. In the threat model, neither the model parameters of the model trainer nor the original data of the data provider will have the problem of privacy leakage. At the same time, this paper verifies the usability and performance of the proposed mechanism through experiments on real-world datasets. The experimental results show that compared with the model trained on the plaintext data set under normal conditions, the model trained by the proposed mechanism has no loss of accuracy and the training time is acceptable. In order to further evaluate the advantages of the scheme in this paper, a comparative experiment with other similar privacy training schemes is carried out. The experimental results show that the computation time of this mechanism on the experimental dataset is less than 5% of the comparison mechanism. At the same time, relying on the characteristics of decentralized training, the scheme in this paper has prospects in practical application scenarios.

投稿的翻译标题SVM Training Mechanism for Secure Sharing of Credit Data
源语言繁体中文
页(从-至)696-708
页数13
期刊Jisuanji Xuebao/Chinese Journal of Computers
44
4
DOI
出版状态已出版 - 4月 2021

关键词

  • Consortium blockchain
  • Credit data
  • Homomorphic encryption
  • Privacy preserving
  • Support vector machine

指纹

探究 '面向征信数据安全共享的SVM训练机制' 的科研主题。它们共同构成独一无二的指纹。

引用此