Named entity recognition based on bilingual co-training

Yegang Li, Heyan Huang*, Xingjian Zhao, Shumin Shi

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

3 引用 (Scopus)

摘要

Named entity recognition (NER) is a very important task in natural language processing (NLP). In this paper we present a semi-supervised approach to extract bilingual named entity, starting from a bilingual corpus where the named entities are extracted independently for each language. Then a bilingual co-training algorithm is used to improve the named entity annotation quality, and iterative process is applied to extract named entity pairs with higher bilingual conformity ratio. This leads to a significant improvement of the monolingual named entity annotation quality for both languages. Experimental result shows that the annotation quality of Chinese NE is improved from 87.17 to 88.28, and improved 80.37 to 81.76 of English NE in F-measure.

源语言英语
主期刊名Chinese Lexical Semantics - 14th Workshop, CLSW 2013, Revised Selected Papers
480-489
页数10
DOI
出版状态已出版 - 2013
活动14th Workshop on Chinese Lexical Semantics, CLSW 2013 - Zhengzhou, 中国
期限: 10 5月 201312 5月 2013

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
8229 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议14th Workshop on Chinese Lexical Semantics, CLSW 2013
国家/地区中国
Zhengzhou
时期10/05/1312/05/13

指纹

探究 'Named entity recognition based on bilingual co-training' 的科研主题。它们共同构成独一无二的指纹。

引用此