Multi-level semantic enhancement based on self-distillation BERT for Chinese named entity recognition

Zepeng Li; Shuo Cao; Minyu Zhai; Nengneng Ding; Zhenwen Zhang; Bin Hu

doi:10.1016/j.neucom.2024.127637

Multi-level semantic enhancement based on self-distillation BERT for Chinese named entity recognition

Zepeng Li, Shuo Cao, Minyu Zhai, Nengneng Ding, Zhenwen Zhang, Bin Hu^*

^*此作品的通讯作者

医学技术学院

科研成果: 期刊稿件 › 文章 › 同行评审

4 引用（Scopus）

摘要

As an important foundational task in the field of natural language processing, the Chinese named entity recognition (NER) task has received widespread attention in recent years. Self-distillation plays a role in exploring the potential of the knowledge carried by internal parameters in the BERT NER model, but few studies have noticed the impact of different granularity semantic information during the distillation process. In this paper, we propose a multi-level semantic enhancement approach based on self-distillation BERT for Chinese named entity recognition. We first design a feasible data augmentation method to improve the training quality for handling complex entity compositions, then construct a boundary smoothing module to achieve the model's moderate learning on entity boundaries. Besides, we utilize the distillation reweighting method to let the model acquire balanced entity and context knowledge. Experimental results on two Chinese named entity recognition benchmark datasets Weibo and Resume have 72.09% and 96.93% F1 scores, respectively. Compared to three different basic distillation BERT models, our model can also produce better results. The source code is available at https://github.com/lookmedandan/MSE.

源语言	英语
文章编号	127637
期刊	Neurocomputing
卷	586
DOI	https://doi.org/10.1016/j.neucom.2024.127637
出版状态	已出版 - 14 6月 2024

访问文件

10.1016/j.neucom.2024.127637

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{81be822990874f29935dc23b9c1e6aa2,

title = "Multi-level semantic enhancement based on self-distillation BERT for Chinese named entity recognition",

abstract = "As an important foundational task in the field of natural language processing, the Chinese named entity recognition (NER) task has received widespread attention in recent years. Self-distillation plays a role in exploring the potential of the knowledge carried by internal parameters in the BERT NER model, but few studies have noticed the impact of different granularity semantic information during the distillation process. In this paper, we propose a multi-level semantic enhancement approach based on self-distillation BERT for Chinese named entity recognition. We first design a feasible data augmentation method to improve the training quality for handling complex entity compositions, then construct a boundary smoothing module to achieve the model's moderate learning on entity boundaries. Besides, we utilize the distillation reweighting method to let the model acquire balanced entity and context knowledge. Experimental results on two Chinese named entity recognition benchmark datasets Weibo and Resume have 72.09% and 96.93% F1 scores, respectively. Compared to three different basic distillation BERT models, our model can also produce better results. The source code is available at https://github.com/lookmedandan/MSE.",

keywords = "Data augmentation, Distillation reweighting, Label smoothing, Named entity recognition, Semantic information",

author = "Zepeng Li and Shuo Cao and Minyu Zhai and Nengneng Ding and Zhenwen Zhang and Bin Hu",

note = "Publisher Copyright: {\textcopyright} 2024",

year = "2024",

month = jun,

day = "14",

doi = "10.1016/j.neucom.2024.127637",

language = "English",

volume = "586",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Multi-level semantic enhancement based on self-distillation BERT for Chinese named entity recognition

AU - Li, Zepeng

AU - Cao, Shuo

AU - Zhai, Minyu

AU - Ding, Nengneng

AU - Zhang, Zhenwen

AU - Hu, Bin

PY - 2024/6/14

Y1 - 2024/6/14

N2 - As an important foundational task in the field of natural language processing, the Chinese named entity recognition (NER) task has received widespread attention in recent years. Self-distillation plays a role in exploring the potential of the knowledge carried by internal parameters in the BERT NER model, but few studies have noticed the impact of different granularity semantic information during the distillation process. In this paper, we propose a multi-level semantic enhancement approach based on self-distillation BERT for Chinese named entity recognition. We first design a feasible data augmentation method to improve the training quality for handling complex entity compositions, then construct a boundary smoothing module to achieve the model's moderate learning on entity boundaries. Besides, we utilize the distillation reweighting method to let the model acquire balanced entity and context knowledge. Experimental results on two Chinese named entity recognition benchmark datasets Weibo and Resume have 72.09% and 96.93% F1 scores, respectively. Compared to three different basic distillation BERT models, our model can also produce better results. The source code is available at https://github.com/lookmedandan/MSE.

AB - As an important foundational task in the field of natural language processing, the Chinese named entity recognition (NER) task has received widespread attention in recent years. Self-distillation plays a role in exploring the potential of the knowledge carried by internal parameters in the BERT NER model, but few studies have noticed the impact of different granularity semantic information during the distillation process. In this paper, we propose a multi-level semantic enhancement approach based on self-distillation BERT for Chinese named entity recognition. We first design a feasible data augmentation method to improve the training quality for handling complex entity compositions, then construct a boundary smoothing module to achieve the model's moderate learning on entity boundaries. Besides, we utilize the distillation reweighting method to let the model acquire balanced entity and context knowledge. Experimental results on two Chinese named entity recognition benchmark datasets Weibo and Resume have 72.09% and 96.93% F1 scores, respectively. Compared to three different basic distillation BERT models, our model can also produce better results. The source code is available at https://github.com/lookmedandan/MSE.

KW - Data augmentation

KW - Distillation reweighting

KW - Label smoothing

KW - Named entity recognition

KW - Semantic information

UR - http://www.scopus.com/inward/record.url?scp=85190069347&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2024.127637

DO - 10.1016/j.neucom.2024.127637

M3 - Article

AN - SCOPUS:85190069347

SN - 0925-2312

VL - 586

JO - Neurocomputing

JF - Neurocomputing

M1 - 127637

ER -

Multi-level semantic enhancement based on self-distillation BERT for Chinese named entity recognition

摘要

访问文件

其它文件与链接

指纹

引用此