TY - JOUR
T1 - Prediction of loan default based on multi-model fusion
AU - Li, Xingyun
AU - Ergu, Daji
AU - Zhang, Di
AU - Qiu, Dafeng
AU - Cai, Ying
AU - Ma, Bo
N1 - Publisher Copyright:
© 2021 The Authors. Published by Elsevier B.V.
PY - 2021
Y1 - 2021
N2 - With the development of Internet technology, online loans continue to enter the public eye, individuals and small businesses must access to more loan opportunities, and it is important for online loan platforms to effectively reduce the credit crisis associated with customer loan defaults. This paper uses the loan default dataset from lending club. The ADASYN (Adaptive synthetic sampling approach) method is adopted to cope with the class imbalance problem of the dataset. In order to improve the prediction accuracy, this paper utilizes the Blending method to fuse three models: Logistic Regression, Random Forest, and CatBoost. After experimental comparison, it is found that the performance of the fusion model proposed in this paper is better than the three models of Logistic Regression, Random Forest, and CatBoost, which can effectively predict the probability of customer loan default through the training of the dataset and reduce the external risk brought by the online loan platform facing customer loan default.
AB - With the development of Internet technology, online loans continue to enter the public eye, individuals and small businesses must access to more loan opportunities, and it is important for online loan platforms to effectively reduce the credit crisis associated with customer loan defaults. This paper uses the loan default dataset from lending club. The ADASYN (Adaptive synthetic sampling approach) method is adopted to cope with the class imbalance problem of the dataset. In order to improve the prediction accuracy, this paper utilizes the Blending method to fuse three models: Logistic Regression, Random Forest, and CatBoost. After experimental comparison, it is found that the performance of the fusion model proposed in this paper is better than the three models of Logistic Regression, Random Forest, and CatBoost, which can effectively predict the probability of customer loan default through the training of the dataset and reduce the external risk brought by the online loan platform facing customer loan default.
KW - Credit crisis
KW - Loan default
KW - Multi-model fusion
UR - http://www.scopus.com/inward/record.url?scp=85124982891&partnerID=8YFLogxK
U2 - 10.1016/j.procs.2022.01.094
DO - 10.1016/j.procs.2022.01.094
M3 - Conference article
AN - SCOPUS:85124982891
SN - 1877-0509
VL - 199
SP - 757
EP - 764
JO - Procedia Computer Science
JF - Procedia Computer Science
T2 - 8th International Conference on Information Technology and Quantitative Management, ITQM 2020 and 2021
Y2 - 9 July 2021 through 11 July 2021
ER -