Abstract
With the development of Internet technology, online loans continue to enter the public eye, individuals and small businesses must access to more loan opportunities, and it is important for online loan platforms to effectively reduce the credit crisis associated with customer loan defaults. This paper uses the loan default dataset from lending club. The ADASYN (Adaptive synthetic sampling approach) method is adopted to cope with the class imbalance problem of the dataset. In order to improve the prediction accuracy, this paper utilizes the Blending method to fuse three models: Logistic Regression, Random Forest, and CatBoost. After experimental comparison, it is found that the performance of the fusion model proposed in this paper is better than the three models of Logistic Regression, Random Forest, and CatBoost, which can effectively predict the probability of customer loan default through the training of the dataset and reduce the external risk brought by the online loan platform facing customer loan default.
Original language | English |
---|---|
Pages (from-to) | 757-764 |
Number of pages | 8 |
Journal | Procedia Computer Science |
Volume | 199 |
DOIs | |
Publication status | Published - 2021 |
Externally published | Yes |
Event | 8th International Conference on Information Technology and Quantitative Management, ITQM 2020 and 2021 - Chengdu, China Duration: 9 Jul 2021 → 11 Jul 2021 |
Keywords
- Credit crisis
- Loan default
- Multi-model fusion