Cross-language information retrieval based on weight computation of query keywords translation

Xiao Fei Zhang*, He Yan Huang, Ke Liang Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In cross-language information retrieval (CLIR), the query sentence is often combined with a series of query keywords, rather than a complete natural sentence. Lack of necessary contextual syntactic information in such a query sentence makes it impossible to achieve a unique translation of the query sentence with acceptable precision. In this paper, we convert the translation of query sentence to the weight computation of the translations of the query keyword based on large-scale bilingual parallel corpora, and thereafter reconstruct the query sentence in target language. The experimental results show that the approach achieves an average retrieval accuracy of 93.4% in the front 10 retrieval results and 89.1% in the front 100 retrieval results, while the retrieval error rate is reduced by 63.62% over the purely dictionary-based baseline.

Original languageEnglish
Title of host publicationProceedings - 2009 IEEE International Conference on Intelligent Computing and Intelligent Systems, ICIS 2009
Pages253-256
Number of pages4
DOIs
Publication statusPublished - 2009
Externally publishedYes
Event2009 IEEE International Conference on Intelligent Computing and Intelligent Systems, ICIS 2009 - Shanghai, China
Duration: 20 Nov 200922 Nov 2009

Publication series

NameProceedings - 2009 IEEE International Conference on Intelligent Computing and Intelligent Systems, ICIS 2009
Volume3

Conference

Conference2009 IEEE International Conference on Intelligent Computing and Intelligent Systems, ICIS 2009
Country/TerritoryChina
CityShanghai
Period20/11/0922/11/09

Keywords

  • CLIR
  • Query sentencee
  • Translation of query keyword
  • Weight computation

Fingerprint

Dive into the research topics of 'Cross-language information retrieval based on weight computation of query keywords translation'. Together they form a unique fingerprint.

Cite this