CATE: Contrastive augmentation and tree-enhanced embedding for credit scoring

Ying Gao, Haolang Xiao, Choujun Zhan, Lingrui Liang, Wentian Cai, Xiping Hu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

Credit transactions are vital financial activities that yield substantial economic benefits. To further improve lending decisions, stakeholders require accurate and interpretable credit scoring methods. While the majority of previous studies have focused on the relationship between individual features and credit risk, only a few have investigated cross-features. Notably, cross-features can not only represent structured data effectively but also provide richer semantic information than individual features. Nevertheless, most previous methods for learning cross-feature effects from credit data have been implicit and unexplainable. This paper proposes a new credit scoring model based on contrastive augmentation and tree-enhanced embedding mechanisms, termed CATE. The proposed model automatically constructs explainable cross-features by using tree-based models to learn decision rules from the data. Moreover, the importance of each local cross-feature is then derived through an attention mechanism. Finally, the credit score of a user is evaluated using embedding vectors. Experimental results on 4 public datasets demonstrated the interpretability of our proposed method and outperformed 13 state-of-the-art benchmark methods in terms of performance.

Original languageEnglish
Article number119447
JournalInformation Sciences
Volume651
DOIs
Publication statusPublished - Dec 2023

Keywords

  • Attention mechanism
  • Credit scoring
  • Feature extraction
  • Supervised contrastive learning
  • Tree-based model

Fingerprint

Dive into the research topics of 'CATE: Contrastive augmentation and tree-enhanced embedding for credit scoring'. Together they form a unique fingerprint.

Cite this