Skip to main navigation Skip to search Skip to main content

Medical Knowledge-Driven Contrastive Learning for Similar Patient Retrieval

  • Fanqing Meng
  • , Chong Feng
  • , Ge Shi*
  • , Xia Liu
  • , Bo Wang
  • , Kaiyuan Zhang
  • , Yan Zhuang
  • *Corresponding author for this work
  • Beijing Institute of Technology
  • China-Japan Friendship Hospital
  • General Hospital of People's Liberation Army

Research output: Contribution to journalArticlepeer-review

Abstract

Similar patient retrieval is a fundamental task in medical informatics, aiming to identify patients with similar clinical characteristics to assist in diagnosis and treatment plan recommendation. While traditional methods relying on lexical features or medical ontologies often fail to capture implicit semantic relationships, recent advancements in dense retrieval methods powered by deep learning have shown promise yet face challenges in adapting to specific tasks such as similar patient retrieval. To address these limitations, we propose a medical knowledge-driven contrastive learning approach to enhance the representation capacity of general-purpose embedding models for medical text. Specifically, our approach introduces a novel negative sampling strategy leveraging International Classification of Diseases (ICD) codes to identify hard negatives. However, due to data imbalance issues, this method struggles to adequately mine negative examples. To overcome this limitation, we develop an external knowledge-based negative sampling method that incorporates both statistical and ambiguous knowledge, thereby enhancing the model’s ability to differentiate between fine-grained medical conditions and complex clinical scenarios. We then integrate these methods into a contrastive learning framework to train more robust patient representations. Extensive experiments on real-world medical datasets show that our proposed method achieves significant improvements over existing state-of-the-art baseline models.

Original languageEnglish
JournalIEEE Journal of Biomedical and Health Informatics
DOIs
Publication statusAccepted/In press - 2026

Keywords

  • Similar patient retrieval
  • contrastive learning
  • medical knowledge
  • negative sampling

Fingerprint

Dive into the research topics of 'Medical Knowledge-Driven Contrastive Learning for Similar Patient Retrieval'. Together they form a unique fingerprint.

Cite this