TY - GEN
T1 - A domain-specific chinese term extraction method based on prefix and suffix
AU - Li, Dongmei
AU - Wang, Qinglin
AU - Li, Yuan
AU - Peng, Qian
PY - 2012
Y1 - 2012
N2 - The term recognition and extraction is the foundation of text information processing. This paper presents a domain-specific Chinese term extraction method based on prefix and suffix. Firstly, the commonly used prefix and suffix are extracted from a given set of seed terms. Secondly, we segment the testing corpus to obtain statistics of words which are next to the prefixes and suffixes. And then, we judge whether a word and a prefix/suffix is a candidate term according to frequency information of the word. Thirdly, we enlarge initial candidate term set by frequency judgment. Finally we filter candidate terms by co-occurrence analysis. Experiment shows that terms with common prefixes and suffixes can be well extracted.
AB - The term recognition and extraction is the foundation of text information processing. This paper presents a domain-specific Chinese term extraction method based on prefix and suffix. Firstly, the commonly used prefix and suffix are extracted from a given set of seed terms. Secondly, we segment the testing corpus to obtain statistics of words which are next to the prefixes and suffixes. And then, we judge whether a word and a prefix/suffix is a candidate term according to frequency information of the word. Thirdly, we enlarge initial candidate term set by frequency judgment. Finally we filter candidate terms by co-occurrence analysis. Experiment shows that terms with common prefixes and suffixes can be well extracted.
KW - co-occurrence analysis
KW - domain-specific term
KW - term extraction
KW - term recognition
UR - http://www.scopus.com/inward/record.url?scp=84873849541&partnerID=8YFLogxK
U2 - 10.1109/CSSS.2012.342
DO - 10.1109/CSSS.2012.342
M3 - Conference contribution
AN - SCOPUS:84873849541
SN - 9780769547190
T3 - Proceedings - 2012 International Conference on Computer Science and Service System, CSSS 2012
SP - 1356
EP - 1359
BT - Proceedings - 2012 International Conference on Computer Science and Service System, CSSS 2012
T2 - 2012 International Conference on Computer Science and Service System, CSSS 2012
Y2 - 11 August 2012 through 13 August 2012
ER -