跳到主要导航 跳到搜索 跳到主要内容

Chinese term extraction based on PAT tree

  • Feng Zhang*
  • , Xiao Zhong Fan
  • , Yun Xu
  • *此作品的通讯作者
  • Beijing Institute of Technology

科研成果: 期刊稿件文章同行评审

摘要

A method of automatic Chinese term extraction is proposed based on Patricia (PAT) tree. Mutual information is calculated based on prefix searching in PAT tree of domain corpus to estimate the internal associative strength between Chinese characters in a string. It can improve the speed of term candidate extraction largely compared with methods based on domain corpus directly. Common collocation suffix, prefix bank are constructed and term part of speech (POS) composing rules are summarized to improve the precision of term extraction. Experiment results show that the F-measure is 74.97%.

源语言英语
页(从-至)162-166
页数5
期刊Journal of Beijing Institute of Technology (English Edition)
15
2
出版状态已出版 - 6月 2006

指纹

探究 'Chinese term extraction based on PAT tree' 的科研主题。它们共同构成独一无二的指纹。

引用此