Abstract
Identification of prepositional phrases (PP) has been an issue in the field of Natural Language Processing (NLP). In this paper, towards Chinese patent texts, we present a rule-based method and a CRF-based method to identify the PPs. In the rule-based method, according to the special features and expressions of PPs, we manually write targeted formal identification rules; in the CRF approach, after labelling the sentences with features, a typical CRF toolkit is exploited to train the model for identifying PPs. We then conduct some experiments to test the performance of the two methods, and final precision rates are over 90%, indicating the proposed methods are effective and feasible.
Original language | English |
---|---|
Pages | 143-149 |
Number of pages | 7 |
Publication status | Published - 2015 |
Externally published | Yes |
Event | 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 - Shanghai, China Duration: 30 Oct 2015 → 1 Nov 2015 |
Conference
Conference | 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 |
---|---|
Country/Territory | China |
City | Shanghai |
Period | 30/10/15 → 1/11/15 |