TY - JOUR
T1 - Chinese-clinical-record Named Entity Recognition using IDCNN-BiLSTM-Highway Network
AU - Tang, Tinglong
AU - Guo, Yunqiao
AU - Li, Qixin
AU - Zhou, Mate
AU - Huang, Wei
AU - Wu, Yirong
N1 - Publisher Copyright:
Copyright © 2023 KSII.
PY - 2023/7/31
Y1 - 2023/7/31
N2 - Chinese named entity recognition (NER) is a challenging work that seeks to find, recognize and classify various types of information elements in unstructured text. Due to the Chinese text has no natural boundary like the spaces in the English text, Chinese named entity identification is much more difficult. At present, most deep learning based NER models are developed using a bidirectional long short-term memory network (BiLSTM), yet the performance still has some space to improve. To further improve their performance in Chinese NER tasks, we propose a new NER model, IDCNN-BiLSTM-Highway, which is a combination of the BiLSTM, the iterated dilated convolutional neural network (IDCNN) and the highway network. In our model, IDCNN is used to achieve multiscale context aggregation from a long sequence of words. Highway network is used to effectively connect different layers of networks, allowing information to pass through network layers smoothly without attenuation. Finally, the global optimum tag result is obtained by introducing conditional random field (CRF). The experimental results show that compared with other popular deep learning-based NER models, our model shows superior performance on two Chinese NER data sets: Resume and Yidu-S4k, The F1-scores are 94.98 and 77.59, respectively.
AB - Chinese named entity recognition (NER) is a challenging work that seeks to find, recognize and classify various types of information elements in unstructured text. Due to the Chinese text has no natural boundary like the spaces in the English text, Chinese named entity identification is much more difficult. At present, most deep learning based NER models are developed using a bidirectional long short-term memory network (BiLSTM), yet the performance still has some space to improve. To further improve their performance in Chinese NER tasks, we propose a new NER model, IDCNN-BiLSTM-Highway, which is a combination of the BiLSTM, the iterated dilated convolutional neural network (IDCNN) and the highway network. In our model, IDCNN is used to achieve multiscale context aggregation from a long sequence of words. Highway network is used to effectively connect different layers of networks, allowing information to pass through network layers smoothly without attenuation. Finally, the global optimum tag result is obtained by introducing conditional random field (CRF). The experimental results show that compared with other popular deep learning-based NER models, our model shows superior performance on two Chinese NER data sets: Resume and Yidu-S4k, The F1-scores are 94.98 and 77.59, respectively.
KW - Bidirectional long short-term memory network
KW - Chinese named entity recognition
KW - Conditional random field
KW - Highway network
KW - Iterated dilated convolutional neural network
UR - http://www.scopus.com/inward/record.url?scp=85166740861&partnerID=8YFLogxK
U2 - 10.3837/tiis.2023.07.001
DO - 10.3837/tiis.2023.07.001
M3 - Article
AN - SCOPUS:85166740861
SN - 1976-7277
VL - 17
SP - 1759
EP - 1772
JO - KSII Transactions on Internet and Information Systems
JF - KSII Transactions on Internet and Information Systems
IS - 7
ER -