A CRF method of identifying prepositional phrases in Chinese patent texts

Hongzheng Li, Yaohong Jin

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper presents a Conditional Random Field (CRF) method of identifying prepositional phrases (PP) in Chinese patent documents. By using the CRF model, the identification process can be recognized as sequence labelling issue. After analyzing the characteristics of PP chunks in large scale corpus, we design several essential and helpful features and feature templates for recognizing PP chunks, and then use a CRF toolkit to train the model to identify PPs. At last, some experiments are conducted to justify the effects of the model, both the precision and recall rates are over 92%, higher than the baseline, indicating the method is reasonable and effective.

源语言英语
主期刊名Proceedings of the 8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015 - co-located with 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL IJCNLP 2015
编辑Liang-Chih Yu, Zhifang Sui, Yue Zhang, Vincent Ng
出版商Association for Computational Linguistics (ACL)
86-90
页数5
ISBN(电子版)9781941643570
出版状态已出版 - 2015
已对外发布
活动8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015 - Beijing, 中国
期限: 30 7月 201531 7月 2015

出版系列

姓名Proceedings of the 8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015 - co-located with 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL IJCNLP 2015

会议

会议8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015
国家/地区中国
Beijing
时期30/07/1531/07/15

指纹

探究 'A CRF method of identifying prepositional phrases in Chinese patent texts' 的科研主题。它们共同构成独一无二的指纹。

引用此