TY - GEN
T1 - An English-Chinese cross-lingual word semantic similarity measure exploring attributes and relations
AU - Dai, Lin
AU - Huang, Heyan
PY - 2011
Y1 - 2011
N2 - Word semantic similarity measuring is a fundamental issue to many NLP applications and the globalization has made an urgent request for cross-lingual word similarity measure. This paper proposed a word semantic similarity measure which is able to work in cross-lingual scenarios. Basically, a concept can be defined by a set of attributes. The basic idea of this work is to compute the similarity between words by exploring their attributes and relations. For a given word pair, we first compute similarities between their attributes by combining distance, depth and relation information. Then word similarity are computed through a combination scheme. The algorithm is implemented based on an English-Chinese bilingual ontology HowNet. Experiments show that the proposed algorithm results in high correlation against human judgments, which encourages its broad application in cross-lingual applications.
AB - Word semantic similarity measuring is a fundamental issue to many NLP applications and the globalization has made an urgent request for cross-lingual word similarity measure. This paper proposed a word semantic similarity measure which is able to work in cross-lingual scenarios. Basically, a concept can be defined by a set of attributes. The basic idea of this work is to compute the similarity between words by exploring their attributes and relations. For a given word pair, we first compute similarities between their attributes by combining distance, depth and relation information. Then word similarity are computed through a combination scheme. The algorithm is implemented based on an English-Chinese bilingual ontology HowNet. Experiments show that the proposed algorithm results in high correlation against human judgments, which encourages its broad application in cross-lingual applications.
KW - Computing linguistics
KW - Cross-lingual
KW - Natural language processing
KW - Word semantic similarity
UR - http://www.scopus.com/inward/record.url?scp=84863870551&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84863870551
SN - 9784905166023
T3 - PACLIC 25 - Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation
SP - 467
EP - 476
BT - PACLIC 25 - Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation
T2 - 25th Pacific Asia Conference on Language, Information and Computation, PACLIC 25
Y2 - 16 December 2011 through 18 December 2011
ER -