跳到主要导航 跳到搜索 跳到主要内容

Nested Named Entity Recognition in Chinese Electronic Medical Records

  • Beijing Institute of Technology
  • School of Statistics

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Nested named entity recognition (NER) is crucial in processing Chinese electronic medical records (EMRs). Recently, the BERT-based model using CNN and a multi-head Biaffine decoder has shown promising results in nested NER on news datasets. However, this model faces difficulties in dealing with the complex and unevenly distributed entities in Chinese EMRs, resulting in prediction errors. This paper proposes an MC-BERT-CGC model based on MC-BERT semantic features comprising Context-Gated Convolution and multi-head Biaffine decoder. Our model initially incorporates Chinese medical language knowledge by leveraging MC-BERT to represent medical descriptions as sentence vectors. We then use Context-Gated Convolution to accurately define the boundaries of nested entities by learning overlapping relationships between different entities. Finally, we use Focal Loss to classify difficult-to-distinguish entities. Experimental results tested on our Chinese EMRs and the CMeEE-V2 dataset show that our model performs better than existing baseline models in Chinese medical NER tasks. The impacts of this study on the life of patients are significant, as more accurate and detailed medical information can be extracted from EMRs, potentially leading to improved diagnoses, personalized treatment recommendations, and proactive identification of health risks. Our code is available at https://github.com/ymlmorning/MC-BERT-CGC.

源语言英语
主期刊名Computational Intelligence Methods for Bioinformatics and Biostatistics - 18th International Meeting, CIBB 2023, Revised Selected Papers
编辑Martina Vettoretti, Erica Tavazzi, Enrico Longato, Giacomo Baruzzo, Massimo Bellato
出版商Springer Science and Business Media Deutschland GmbH
58-69
页数12
ISBN(印刷版)9783031907135
DOI
出版状态已出版 - 2025
已对外发布
活动18th International Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics, CIBB 2023 - Padova, 意大利
期限: 6 9月 20238 9月 2023

出版系列

姓名Lecture Notes in Computer Science
14513 LNBI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议18th International Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics, CIBB 2023
国家/地区意大利
Padova
时期6/09/238/09/23

联合国可持续发展目标

此成果有助于实现下列可持续发展目标:

  1. 可持续发展目标 3 - 良好健康与福祉
    可持续发展目标 3 良好健康与福祉

指纹

探究 'Nested Named Entity Recognition in Chinese Electronic Medical Records' 的科研主题。它们共同构成独一无二的指纹。

引用此