A cross language text categorization algorithm from the perspective of information retrieval

Yue Liu*, Ming Tian, Weitao Zhou, Lin Dai

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

In this paper, we propose a novel method that performs Cross Language Text Categorization (CLTC) from the perspective of Information Retrieval. We present an input document in target language in the form of a query in source language. Then we retrieve the training documents in source language and find K most relevant results. At last, we use the class labels of the K results to predict the class of the input document. The only external resource required by our method is a bilingual dictionary. Experimental results show that our method gives promising performance, which is better than translation-based method.

Original languageEnglish
Title of host publicationProceedings of the 2012 International Conference on Industrial Control and Electronics Engineering, ICICEE 2012
Pages254-257
Number of pages4
DOIs
Publication statusPublished - 2012
Event2012 International Conference on Industrial Control and Electronics Engineering, ICICEE 2012 - Xi'an, China
Duration: 23 Aug 201225 Aug 2012

Publication series

NameProceedings of the 2012 International Conference on Industrial Control and Electronics Engineering, ICICEE 2012

Conference

Conference2012 International Conference on Industrial Control and Electronics Engineering, ICICEE 2012
Country/TerritoryChina
CityXi'an
Period23/08/1225/08/12

Keywords

  • Cross Language Text Categorization
  • Information Retrieval
  • Text Categorization

Fingerprint

Dive into the research topics of 'A cross language text categorization algorithm from the perspective of information retrieval'. Together they form a unique fingerprint.

Cite this