TY - JOUR
T1 - Feature words selection for knowledge-based word sense disambiguation with syntactic parsing
AU - Lu, Wenpeng
AU - Huang, Heyan
AU - Zhu, Chaoyong
PY - 2012
Y1 - 2012
N2 - Feature words are crucial clues for word sense disambiguation. There are two methods to select feature words: window-based and dependency-based methods. Both of them have some shortcomings, such as irrelevant noise words or paucity of feature words. In order to solve the problems of the existing methods, this paper proposes two methods to select feature words with syntactic parsing, which are based on phrase structure parsing tree (PTree) and dependency parsing tree (DTree). With the help of syntactic parsing, the proposed methods can select feature words more accurately, which can alleviate the effect of noise words of window-based method and can avoid the paucity of feature words of dependency-based method. Evaluation is performed on a knowledge-based WSD system with a publicly available lexical sample dataset. The results show that both of the proposed methods are superior to window-based and dependency-based methods, and the method based on PTree is better than the method based on DTree. Both of them are preferred strategies to select feature words to disambiguate ambiguous words.
AB - Feature words are crucial clues for word sense disambiguation. There are two methods to select feature words: window-based and dependency-based methods. Both of them have some shortcomings, such as irrelevant noise words or paucity of feature words. In order to solve the problems of the existing methods, this paper proposes two methods to select feature words with syntactic parsing, which are based on phrase structure parsing tree (PTree) and dependency parsing tree (DTree). With the help of syntactic parsing, the proposed methods can select feature words more accurately, which can alleviate the effect of noise words of window-based method and can avoid the paucity of feature words of dependency-based method. Evaluation is performed on a knowledge-based WSD system with a publicly available lexical sample dataset. The results show that both of the proposed methods are superior to window-based and dependency-based methods, and the method based on PTree is better than the method based on DTree. Both of them are preferred strategies to select feature words to disambiguate ambiguous words.
KW - Dependency parsing
KW - Parsing tree
KW - Phrase structure parsing
KW - Word sense disambiguation
UR - http://www.scopus.com/inward/record.url?scp=84855260614&partnerID=8YFLogxK
M3 - Article
AN - SCOPUS:84855260614
SN - 0033-2097
VL - 88
SP - 82
EP - 87
JO - Przeglad Elektrotechniczny
JF - Przeglad Elektrotechniczny
IS - 1 B
ER -