TY - JOUR
T1 - VisPhone
T2 - Chinese named entity recognition model enhanced by visual and phonetic features
AU - Zhang, Baohua
AU - Cai, Jiahao
AU - Zhang, Huaping
AU - Shang, Jianyun
N1 - Publisher Copyright:
© 2023 The Authors
PY - 2023/5
Y1 - 2023/5
N2 - Many Chinese NER models only focus on lexical and radical information, ignoring the fact that there are also certain rules for the pronunciation of Chinese entities. In this paper, we propose VisPhone, which incorporates Chinese characters’ Phonetic features into Transformer Encoder along with the Lattice and Visual features. We present the common rules for the pronunciation of Chinese entities and explore the most appropriate method to encode it. VisPhone uses two identical cross transformer encoders to fuse the visual and phonetic features of the input characters with the text embedding. A selective fusion module is used to get the final features. We conducted experiments on four well-known Chinese NER benchmark datasets: OntoNotes4.0, MSRA, Resume, and Weibo, with F1 scores of 82.63%, 96.07%, 96.26%, 70.79% respectively, improving the performance by 0.79%, 0.32%, 0.39%, and 3.47%. Our ablation experiments have also demonstrated the effectiveness of VisPhone.
AB - Many Chinese NER models only focus on lexical and radical information, ignoring the fact that there are also certain rules for the pronunciation of Chinese entities. In this paper, we propose VisPhone, which incorporates Chinese characters’ Phonetic features into Transformer Encoder along with the Lattice and Visual features. We present the common rules for the pronunciation of Chinese entities and explore the most appropriate method to encode it. VisPhone uses two identical cross transformer encoders to fuse the visual and phonetic features of the input characters with the text embedding. A selective fusion module is used to get the final features. We conducted experiments on four well-known Chinese NER benchmark datasets: OntoNotes4.0, MSRA, Resume, and Weibo, with F1 scores of 82.63%, 96.07%, 96.26%, 70.79% respectively, improving the performance by 0.79%, 0.32%, 0.39%, and 3.47%. Our ablation experiments have also demonstrated the effectiveness of VisPhone.
KW - Chinese NER
KW - Cross transformer
KW - Phonetic information
KW - Selective fusion
KW - Visual information
UR - http://www.scopus.com/inward/record.url?scp=85148077878&partnerID=8YFLogxK
U2 - 10.1016/j.ipm.2023.103314
DO - 10.1016/j.ipm.2023.103314
M3 - Article
AN - SCOPUS:85148077878
SN - 0306-4573
VL - 60
JO - Information Processing and Management
JF - Information Processing and Management
IS - 3
M1 - 103314
ER -