Title recognition of maximal-length noun phrase based on bilingual co-training

Ye Gang Li, He Yan Huang*, Shu Min Shi, Ping Jian, Chao Su

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 2
  • Captures
    • Readers: 3
see details

摘要

This article focuses on the problem of weak cross-domain ability on bilingual maximal-length noun phrase recognition. A bilingual noun phrase recognition algorithm based on semi-supervised learning is proposed. The approach can make full use of both the English features and the Chinese features in a unified framework, and it regards the two language corpus as different view of one dataset. Instances with the highest confidence score are selected and merged, and then added to the labeled data set to train the classifier. Experimental results on test sets show the effectiveness of the proposed approach which outperforms 4.52% over the baseline in cross-domain, and 3.08% over the baseline in similar domain.

源语言英语
页(从-至)1615-1625
页数11
期刊Ruan Jian Xue Bao/Journal of Software
26
7
DOI
出版状态已出版 - 1 7月 2015

指纹

探究 'Title recognition of maximal-length noun phrase based on bilingual co-training' 的科研主题。它们共同构成独一无二的指纹。

引用此

Li, Y. G., Huang, H. Y., Shi, S. M., Jian, P., & Su, C. (2015). Title recognition of maximal-length noun phrase based on bilingual co-training. Ruan Jian Xue Bao/Journal of Software, 26(7), 1615-1625. https://doi.org/10.13328/j.cnki.jos.004630