融合预训练模型的端到端语音命名实体识别

Translated title of the contribution: End-to-End Speech Named Entity Recognition with Pretrained Models

Tianwei Lan, Yuhang Guo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Speech Named Entity Recognition (SNER) aims to recognize the boundary, type and content of named entities in speech from audio, which is one of the important tasks in spoken language understanding. Recognizing named entities directly from speech, that is, the end-to-end method is the current mainstream method of SNER. However, the training corpus for speech named entity recognition is less, and the end-to-end model has the following problems: (1) The recognition effect of the model will be greatly reduced in the case of cross-domain recognition. (2) During the recognition process, the model may miss or mislabel named entities due to phenomena such as homophones, which further affects the accuracy of named entity recognition. Aiming at problem (1), this paper proposes to use a pre-trained entity recognition model to construct a training corpus for speech entity recognition. For problem (2), this paper proposes to use the pre-trained language model to re-score the N-BEST list of speech named entity recognition, and use the external knowledge in the pre-trained model to help the end-to-end model select the best result. In order to verify the domain migration ability of the model, we labeled the MAGICDATA-NER data set with few samples. The experiment on this data shows that the method proposed in this paper has an improvement of 43.29% in F1 value compared with the traditional method.

Translated title of the contributionEnd-to-End Speech Named Entity Recognition with Pretrained Models
Original languageChinese (Traditional)
Title of host publicationProceedings of the 22nd Chinese National Conference on Computational Linguistics, CCL 2023
EditorsMaosong Sun, Bing Qin, Xipeng Qiu, Jing Jiang, Xianpei Han
PublisherAssociation for Computational Linguistics (ACL)
Pages174-185
Number of pages12
ISBN (Electronic)9781713876229
Publication statusPublished - 2023
Event22nd Chinese National Conference on Computational Linguistics, CCL 2023 - Harbin, China
Duration: 3 Aug 20235 Aug 2023

Publication series

NameProceedings of the 22nd Chinese National Conference on Computational Linguistics, CCL 2023
Volume1

Conference

Conference22nd Chinese National Conference on Computational Linguistics, CCL 2023
Country/TerritoryChina
CityHarbin
Period3/08/235/08/23

Fingerprint

Dive into the research topics of 'End-to-End Speech Named Entity Recognition with Pretrained Models'. Together they form a unique fingerprint.

Cite this