Identifying prepositional phrases in Chinese patent texts with rule-based and CRF methods

Hongzheng Li; Yaohong Jin

Identifying prepositional phrases in Chinese patent texts with rule-based and CRF methods

Beijing Normal University

Research output: Contribution to conference › Paper › peer-review

Abstract

Identification of prepositional phrases (PP) has been an issue in the field of Natural Language Processing (NLP). In this paper, towards Chinese patent texts, we present a rule-based method and a CRF-based method to identify the PPs. In the rule-based method, according to the special features and expressions of PPs, we manually write targeted formal identification rules; in the CRF approach, after labelling the sentences with features, a typical CRF toolkit is exploited to train the model for identifying PPs. We then conduct some experiments to test the performance of the two methods, and final precision rates are over 90%, indicating the proposed methods are effective and feasible.

Original language	English
Pages	143-149
Number of pages	7
Publication status	Published - 2015
Externally published	Yes
Event	29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 - Shanghai, China Duration: 30 Oct 2015 → 1 Nov 2015

Conference

Conference	29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015
Country/Territory	China
City	Shanghai
Period	30/10/15 → 1/11/15

Cite this

@conference{198df2cc4cc2470ca06407443e3609a3,

title = "Identifying prepositional phrases in Chinese patent texts with rule-based and CRF methods",

abstract = "Identification of prepositional phrases (PP) has been an issue in the field of Natural Language Processing (NLP). In this paper, towards Chinese patent texts, we present a rule-based method and a CRF-based method to identify the PPs. In the rule-based method, according to the special features and expressions of PPs, we manually write targeted formal identification rules; in the CRF approach, after labelling the sentences with features, a typical CRF toolkit is exploited to train the model for identifying PPs. We then conduct some experiments to test the performance of the two methods, and final precision rates are over 90%, indicating the proposed methods are effective and feasible.",

author = "Hongzheng Li and Yaohong Jin",

year = "2015",

language = "English",

pages = "143--149",

note = "29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 ; Conference date: 30-10-2015 Through 01-11-2015",

}

TY - CONF

T1 - Identifying prepositional phrases in Chinese patent texts with rule-based and CRF methods

AU - Li, Hongzheng

AU - Jin, Yaohong

PY - 2015

Y1 - 2015

N2 - Identification of prepositional phrases (PP) has been an issue in the field of Natural Language Processing (NLP). In this paper, towards Chinese patent texts, we present a rule-based method and a CRF-based method to identify the PPs. In the rule-based method, according to the special features and expressions of PPs, we manually write targeted formal identification rules; in the CRF approach, after labelling the sentences with features, a typical CRF toolkit is exploited to train the model for identifying PPs. We then conduct some experiments to test the performance of the two methods, and final precision rates are over 90%, indicating the proposed methods are effective and feasible.

AB - Identification of prepositional phrases (PP) has been an issue in the field of Natural Language Processing (NLP). In this paper, towards Chinese patent texts, we present a rule-based method and a CRF-based method to identify the PPs. In the rule-based method, according to the special features and expressions of PPs, we manually write targeted formal identification rules; in the CRF approach, after labelling the sentences with features, a typical CRF toolkit is exploited to train the model for identifying PPs. We then conduct some experiments to test the performance of the two methods, and final precision rates are over 90%, indicating the proposed methods are effective and feasible.

UR - http://www.scopus.com/inward/record.url?scp=84967211868&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:84967211868

SP - 143

EP - 149

T2 - 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015

Y2 - 30 October 2015 through 1 November 2015

ER -

Identifying prepositional phrases in Chinese patent texts with rule-based and CRF methods

Abstract

Conference

Other files and links

Fingerprint

Cite this