Identifying prepositional phrases in Chinese patent texts with rule-based and CRF methods

Hongzheng Li, Yaohong Jin

Research output: Contribution to conferencePaperpeer-review

Abstract

Identification of prepositional phrases (PP) has been an issue in the field of Natural Language Processing (NLP). In this paper, towards Chinese patent texts, we present a rule-based method and a CRF-based method to identify the PPs. In the rule-based method, according to the special features and expressions of PPs, we manually write targeted formal identification rules; in the CRF approach, after labelling the sentences with features, a typical CRF toolkit is exploited to train the model for identifying PPs. We then conduct some experiments to test the performance of the two methods, and final precision rates are over 90%, indicating the proposed methods are effective and feasible.

Original languageEnglish
Pages143-149
Number of pages7
Publication statusPublished - 2015
Externally publishedYes
Event29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 - Shanghai, China
Duration: 30 Oct 20151 Nov 2015

Conference

Conference29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015
Country/TerritoryChina
CityShanghai
Period30/10/151/11/15

Fingerprint

Dive into the research topics of 'Identifying prepositional phrases in Chinese patent texts with rule-based and CRF methods'. Together they form a unique fingerprint.

Cite this