TY - JOUR
T1 - Analyzing Structures of Medical Imaging Diagnosis Reports
AU - Yu, Sheng
AU - Huirong, Hu
AU - Congcong, Wang
AU - Shengyi, Yang
N1 - Publisher Copyright:
© 2022, Chinese Academy of Sciences. All rights reserved.
PY - 2022/10/25
Y1 - 2022/10/25
N2 - [Objective] This paper tries to turn medical imaging diagnosis reports into structured data, aiming to effectively extract information from these free-text-reports. [Methods] First, we analyzed the text characteristics of medical imaging diagnosis reports, and proposed a structuring method based on entity recognition and rule extraction. Then, we annotated 800 reports to construct datasets for model evaluation. [Results] The proposed method had a precision rate of 0.87 for all entities from the medical imaging diagnostic reports, which was 4.03% higher than that of the BERT-BiLSTM-CRF. Its recall rate was also 2.81% higher than that of the BERT-BiLSTM-CRF. Compared with the method of dependency analysis, the proposed model improved the recognition precision of medical exam items and results by 5.62% and 2.31%. [Limitations] We only examined the proposed method with diagnostic PET-CT imaging reports from one hospital. [Conclusions] This study successfully converts the free texts of medical imaging diagnostic reports to structured data. It not only optimizes the classification, storage, and retrieval of medical reports, but also provides supports for future research on medical imaging.
AB - [Objective] This paper tries to turn medical imaging diagnosis reports into structured data, aiming to effectively extract information from these free-text-reports. [Methods] First, we analyzed the text characteristics of medical imaging diagnosis reports, and proposed a structuring method based on entity recognition and rule extraction. Then, we annotated 800 reports to construct datasets for model evaluation. [Results] The proposed method had a precision rate of 0.87 for all entities from the medical imaging diagnostic reports, which was 4.03% higher than that of the BERT-BiLSTM-CRF. Its recall rate was also 2.81% higher than that of the BERT-BiLSTM-CRF. Compared with the method of dependency analysis, the proposed model improved the recognition precision of medical exam items and results by 5.62% and 2.31%. [Limitations] We only examined the proposed method with diagnostic PET-CT imaging reports from one hospital. [Conclusions] This study successfully converts the free texts of medical imaging diagnostic reports to structured data. It not only optimizes the classification, storage, and retrieval of medical reports, but also provides supports for future research on medical imaging.
KW - Entity Recognition
KW - Medical Imaging Diagnosis Report
KW - Rule Extraction
KW - Structure
UR - http://www.scopus.com/inward/record.url?scp=85142093335&partnerID=8YFLogxK
U2 - 10.11925/infotech.2096-3467.2022.0085
DO - 10.11925/infotech.2096-3467.2022.0085
M3 - Article
AN - SCOPUS:85142093335
SN - 2096-3467
VL - 6
SP - 46
EP - 56
JO - Data Analysis and Knowledge Discovery
JF - Data Analysis and Knowledge Discovery
IS - 10
ER -