Prediction of loan default based on multi-model fusion

Xingyun Li, Daji Ergu*, Di Zhang, Dafeng Qiu, Ying Cai, Bo Ma

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

12 Citations (Scopus)

Abstract

With the development of Internet technology, online loans continue to enter the public eye, individuals and small businesses must access to more loan opportunities, and it is important for online loan platforms to effectively reduce the credit crisis associated with customer loan defaults. This paper uses the loan default dataset from lending club. The ADASYN (Adaptive synthetic sampling approach) method is adopted to cope with the class imbalance problem of the dataset. In order to improve the prediction accuracy, this paper utilizes the Blending method to fuse three models: Logistic Regression, Random Forest, and CatBoost. After experimental comparison, it is found that the performance of the fusion model proposed in this paper is better than the three models of Logistic Regression, Random Forest, and CatBoost, which can effectively predict the probability of customer loan default through the training of the dataset and reduce the external risk brought by the online loan platform facing customer loan default.

Original languageEnglish
Pages (from-to)757-764
Number of pages8
JournalProcedia Computer Science
Volume199
DOIs
Publication statusPublished - 2021
Externally publishedYes
Event8th International Conference on Information Technology and Quantitative Management, ITQM 2020 and 2021 - Chengdu, China
Duration: 9 Jul 202111 Jul 2021

Keywords

  • Credit crisis
  • Loan default
  • Multi-model fusion

Fingerprint

Dive into the research topics of 'Prediction of loan default based on multi-model fusion'. Together they form a unique fingerprint.

Cite this