Hierarchical domain assignment based on word-gloss

Chaoyong Zhu*, Heyan Huang, Shumin Shi

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

This paper proposes a hierarchical word domain assignment algorithm to automatically build domain dictionaries from Machine-Readable Dictionary (MRD). The process for word domain assignment can be divided into three steps: 1) Hierarchical structure constructing; 2) Classifier training; 3) Word domain assigning. Compared with the traditional methods, the hierarchical word domain assignment algorithm enhances the accuracy of word domain assignment while reducing human efforts on collecting corpus. Experiments on WordNet 2.0 show that 62.53% of the first domain labels are matched with the WordNet Domains 3.0 by using gloss-based word domain assignment, and the performance can be further improved by utilizing the hierarchical relationships among the domain sets.

Original languageEnglish
Pages (from-to)19-27
Number of pages9
JournalChina Communications
Volume9
Issue number3
Publication statusPublished - 15 Mar 2012

Keywords

  • Domain assignment
  • Domain dictionary
  • Hierarchical classification
  • MRD
  • Natural language processing
  • WordNet

Fingerprint

Dive into the research topics of 'Hierarchical domain assignment based on word-gloss'. Together they form a unique fingerprint.

Cite this