VisPhone: Chinese named entity recognition model enhanced by visual and phonetic features

Baohua Zhang, Jiahao Cai, Huaping Zhang*, Jianyun Shang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

16 引用 (Scopus)

摘要

Many Chinese NER models only focus on lexical and radical information, ignoring the fact that there are also certain rules for the pronunciation of Chinese entities. In this paper, we propose VisPhone, which incorporates Chinese characters’ Phonetic features into Transformer Encoder along with the Lattice and Visual features. We present the common rules for the pronunciation of Chinese entities and explore the most appropriate method to encode it. VisPhone uses two identical cross transformer encoders to fuse the visual and phonetic features of the input characters with the text embedding. A selective fusion module is used to get the final features. We conducted experiments on four well-known Chinese NER benchmark datasets: OntoNotes4.0, MSRA, Resume, and Weibo, with F1 scores of 82.63%, 96.07%, 96.26%, 70.79% respectively, improving the performance by 0.79%, 0.32%, 0.39%, and 3.47%. Our ablation experiments have also demonstrated the effectiveness of VisPhone.

源语言英语
文章编号103314
期刊Information Processing and Management
60
3
DOI
出版状态已出版 - 5月 2023

指纹

探究 'VisPhone: Chinese named entity recognition model enhanced by visual and phonetic features' 的科研主题。它们共同构成独一无二的指纹。

引用此