摘要
Nested named entity recognition (NER) is crucial in processing Chinese electronic medical records (EMRs). Recently, the BERT-based model using CNN and a multi-head Biaffine decoder has shown promising results in nested NER on news datasets. However, this model faces difficulties in dealing with the complex and unevenly distributed entities in Chinese EMRs, resulting in prediction errors. This paper proposes an MC-BERT-CGC model based on MC-BERT semantic features comprising Context-Gated Convolution and multi-head Biaffine decoder. Our model initially incorporates Chinese medical language knowledge by leveraging MC-BERT to represent medical descriptions as sentence vectors. We then use Context-Gated Convolution to accurately define the boundaries of nested entities by learning overlapping relationships between different entities. Finally, we use Focal Loss to classify difficult-to-distinguish entities. Experimental results tested on our Chinese EMRs and the CMeEE-V2 dataset show that our model performs better than existing baseline models in Chinese medical NER tasks. The impacts of this study on the life of patients are significant, as more accurate and detailed medical information can be extracted from EMRs, potentially leading to improved diagnoses, personalized treatment recommendations, and proactive identification of health risks. Our code is available at https://github.com/ymlmorning/MC-BERT-CGC.
| 源语言 | 英语 |
|---|---|
| 主期刊名 | Computational Intelligence Methods for Bioinformatics and Biostatistics - 18th International Meeting, CIBB 2023, Revised Selected Papers |
| 编辑 | Martina Vettoretti, Erica Tavazzi, Enrico Longato, Giacomo Baruzzo, Massimo Bellato |
| 出版商 | Springer Science and Business Media Deutschland GmbH |
| 页 | 58-69 |
| 页数 | 12 |
| ISBN(印刷版) | 9783031907135 |
| DOI | |
| 出版状态 | 已出版 - 2025 |
| 已对外发布 | 是 |
| 活动 | 18th International Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics, CIBB 2023 - Padova, 意大利 期限: 6 9月 2023 → 8 9月 2023 |
出版系列
| 姓名 | Lecture Notes in Computer Science |
|---|---|
| 卷 | 14513 LNBI |
| ISSN(印刷版) | 0302-9743 |
| ISSN(电子版) | 1611-3349 |
会议
| 会议 | 18th International Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics, CIBB 2023 |
|---|---|
| 国家/地区 | 意大利 |
| 市 | Padova |
| 时期 | 6/09/23 → 8/09/23 |
联合国可持续发展目标
此成果有助于实现下列可持续发展目标:
-
可持续发展目标 3 良好健康与福祉
指纹
探究 'Nested Named Entity Recognition in Chinese Electronic Medical Records' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver