跳到主要导航 跳到搜索 跳到主要内容

CRLEDD: Regularized Causalities Learning for Early Detection of Diseases Using Electronic Health Record (EHR) Data

  • Jiang Bian*
  • , Sijia Yang
  • , Haoyi Xiong
  • , Licheng Wang
  • , Yanjie Fu
  • , Zeyi Sun
  • , Zhishan Guo
  • *此作品的通讯作者
  • University of Central Florida
  • Beijing University of Posts and Telecommunications
  • Baidu Inc
  • Missouri University of Science and Technology

科研成果: 期刊稿件文章同行评审

摘要

The availability of Electronic Health Records (EHR) in health care settings has provided tremendous opportunities for early disease detection. While many supervised learning models have been adopted for EHR-based disease early detection, the ill-posed inverse problem in the parameter learning has imposed a significant challenge on improving the accuracy of these algorithms. In this paper, we propose CRLEDD - Causality-Regularized Learning for Early Detection of Disease, an algorithm to improve the performance of Linear Discriminant Analysis (LDA) on top of diagnosis-frequency vector data representation. While most existing regularization methods exploit sparsity regularization to improve detection performance, CRLEDD provides a unique perspective by ensuring positive semi-definiteness of the sparsified precision matrix used in LDA which is different from the regular regularization method (e.g., L2 regularization). To achieve this goal, CRLEDD employs Graphical Lasso to estimate the precision matrix in the ill-posed settings for enhanced accuracy of LDA classifiers. We perform extensive evaluation of CRLEDD using a large-scale real-world EHR dataset to predict mental health disorders (e.g., depression and anxiety) of college students from 10 universities in the U.S. We compare CRLEDD with other regularized LDA and downstream classifiers. The result shows that CRLEDD outperforms all baselines in terms of accuracy and F1 scores.

源语言英语
文章编号9163317
页(从-至)541-553
页数13
期刊IEEE Transactions on Emerging Topics in Computational Intelligence
5
4
DOI
出版状态已出版 - 8月 2021
已对外发布

指纹

探究 'CRLEDD: Regularized Causalities Learning for Early Detection of Diseases Using Electronic Health Record (EHR) Data' 的科研主题。它们共同构成独一无二的指纹。

引用此