CRLEDD: Regularized Causalities Learning for Early Detection of Diseases Using Electronic Health Record (EHR) Data

  • Jiang Bian*
  • , Sijia Yang
  • , Haoyi Xiong
  • , Licheng Wang
  • , Yanjie Fu
  • , Zeyi Sun
  • , Zhishan Guo
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

9 Citations (Scopus)

Abstract

The availability of Electronic Health Records (EHR) in health care settings has provided tremendous opportunities for early disease detection. While many supervised learning models have been adopted for EHR-based disease early detection, the ill-posed inverse problem in the parameter learning has imposed a significant challenge on improving the accuracy of these algorithms. In this paper, we propose CRLEDD - Causality-Regularized Learning for Early Detection of Disease, an algorithm to improve the performance of Linear Discriminant Analysis (LDA) on top of diagnosis-frequency vector data representation. While most existing regularization methods exploit sparsity regularization to improve detection performance, CRLEDD provides a unique perspective by ensuring positive semi-definiteness of the sparsified precision matrix used in LDA which is different from the regular regularization method (e.g., L2 regularization). To achieve this goal, CRLEDD employs Graphical Lasso to estimate the precision matrix in the ill-posed settings for enhanced accuracy of LDA classifiers. We perform extensive evaluation of CRLEDD using a large-scale real-world EHR dataset to predict mental health disorders (e.g., depression and anxiety) of college students from 10 universities in the U.S. We compare CRLEDD with other regularized LDA and downstream classifiers. The result shows that CRLEDD outperforms all baselines in terms of accuracy and F1 scores.

Original languageEnglish
Article number9163317
Pages (from-to)541-553
Number of pages13
JournalIEEE Transactions on Emerging Topics in Computational Intelligence
Volume5
Issue number4
DOIs
Publication statusPublished - Aug 2021
Externally publishedYes

Keywords

  • Classification algorithms
  • detection algorithms
  • linear discriminant analysis

Fingerprint

Dive into the research topics of 'CRLEDD: Regularized Causalities Learning for Early Detection of Diseases Using Electronic Health Record (EHR) Data'. Together they form a unique fingerprint.

Cite this