Automatic identifying of maximal length noun phrase

Yegang Li, Heyan Huang

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The automatic recognition of the maximal-length noun phrase (MNP) helps to the shallow parsing. In this paper, automatic labeling of Chinese MNP is regarded as a sequential labeling task and Support Vector Machine model (SVM) is employed in the model. We propose a method which takes 2-phase hybrid approach which first identifies base chunk and then identifies MNP. Furthermore, the base chunk features can be exploited to improve performance of MNP recognition. In addition, both left-right and right-left sequential labeling were employed to identify Chinese MNP by bidirectional sequence labeling merging. The data set in the experiments is selected from Penn Chinese Treebank 5.0 Corpus, and split into train set, development set and test set according to the proportion of 4:4:1. Experimental result shows a high quality performance of 90.13% in F1-measure.

源语言英语
主期刊名Proceedings - 2012 IEEE 2nd International Conference on Cloud Computing and Intelligence Systems, IEEE CCIS 2012
1445-1448
页数4
DOI
出版状态已出版 - 13 11月 2013
活动2012 2nd IEEE International Conference on Cloud Computing and Intelligence Systems, IEEE CCIS 2012 - Hangzhou, 中国
期限: 30 10月 20121 11月 2012

出版系列

姓名Proceedings - 2012 IEEE 2nd International Conference on Cloud Computing and Intelligence Systems, IEEE CCIS 2012
3

会议

会议2012 2nd IEEE International Conference on Cloud Computing and Intelligence Systems, IEEE CCIS 2012
国家/地区中国
Hangzhou
时期30/10/121/11/12

指纹

探究 'Automatic identifying of maximal length noun phrase' 的科研主题。它们共同构成独一无二的指纹。

引用此