TY - GEN
T1 - Extracting Chinese multi-word terms from small corpus
AU - Zhou, Lang
AU - Zhang, Liang
AU - Feng, Chong
AU - Huang, Heyan
PY - 2008
Y1 - 2008
N2 - In this paper, we present an automatic terminology extraction approach for Chinese multi-word terms. In this term extraction system, besides five linguistic rides acqidred from an available term list by some machine learning methods, two statistical strategies are involved: a termhood measure based on the term distribution variation, and a unithood measure adopting the left and right entropy method to estimate the collocation variation degree. The candidates are ranked according to the values of the former. The latter is used to filter the preposition phrases and some verb-object phrases that rarely appear as terms. By validating on a small scale corpus in the computer domain, the precision reaches 91.5% of the top 2000 outputs.
AB - In this paper, we present an automatic terminology extraction approach for Chinese multi-word terms. In this term extraction system, besides five linguistic rides acqidred from an available term list by some machine learning methods, two statistical strategies are involved: a termhood measure based on the term distribution variation, and a unithood measure adopting the left and right entropy method to estimate the collocation variation degree. The candidates are ranked according to the values of the former. The latter is used to filter the preposition phrases and some verb-object phrases that rarely appear as terms. By validating on a small scale corpus in the computer domain, the precision reaches 91.5% of the top 2000 outputs.
UR - http://www.scopus.com/inward/record.url?scp=60349113773&partnerID=8YFLogxK
U2 - 10.1109/ISKE.2008.4731041
DO - 10.1109/ISKE.2008.4731041
M3 - Conference contribution
AN - SCOPUS:60349113773
SN - 9781424421978
T3 - Proceedings of 2008 3rd International Conference on Intelligent System and Knowledge Engineering, ISKE 2008
SP - 813
EP - 818
BT - Proceedings of 2008 3rd International Conference on Intelligent System and Knowledge Engineering, ISKE 2008
T2 - Proceedings of 2008 3rd International Conference on Intelligent System and Knowledge Engineering, ISKE 2008
Y2 - 17 November 2008 through 19 November 2008
ER -