跳到主要导航 跳到搜索 跳到主要内容

A domain-specific chinese term extraction method based on prefix and suffix

  • Dongmei Li*
  • , Qinglin Wang
  • , Yuan Li
  • , Qian Peng
  • *此作品的通讯作者
  • Beijing Institute of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The term recognition and extraction is the foundation of text information processing. This paper presents a domain-specific Chinese term extraction method based on prefix and suffix. Firstly, the commonly used prefix and suffix are extracted from a given set of seed terms. Secondly, we segment the testing corpus to obtain statistics of words which are next to the prefixes and suffixes. And then, we judge whether a word and a prefix/suffix is a candidate term according to frequency information of the word. Thirdly, we enlarge initial candidate term set by frequency judgment. Finally we filter candidate terms by co-occurrence analysis. Experiment shows that terms with common prefixes and suffixes can be well extracted.

源语言英语
主期刊名Proceedings - 2012 International Conference on Computer Science and Service System, CSSS 2012
1356-1359
页数4
DOI
出版状态已出版 - 2012
活动2012 International Conference on Computer Science and Service System, CSSS 2012 - Nanjing, 中国
期限: 11 8月 201213 8月 2012

出版系列

姓名Proceedings - 2012 International Conference on Computer Science and Service System, CSSS 2012

会议

会议2012 International Conference on Computer Science and Service System, CSSS 2012
国家/地区中国
Nanjing
时期11/08/1213/08/12

指纹

探究 'A domain-specific chinese term extraction method based on prefix and suffix' 的科研主题。它们共同构成独一无二的指纹。

引用此