Abstract
Based on large-scale bilingual corpora and the theories of vector space model and lexical mutual information, this paper explores the application of the traditional monolingual IR technology to converting the translation of query sentence to the computation of the boost value of query keyword translations in the bilingual dictionary, so that the target language query sentence is reconstructed. The experiment finds a 92.8% precision rate in the first 10 retrieved documents and an 88.9% precision rate in the first 100 retrieved documents.
Original language | English |
---|---|
Pages | 467-470 |
Number of pages | 4 |
Publication status | Published - 2006 |
Externally published | Yes |
Event | 20th Pacific Asia Conference on Language, Information and Computation, PACLIC 20 - Wuhan, China Duration: 1 Nov 2006 → 3 Nov 2006 |
Conference
Conference | 20th Pacific Asia Conference on Language, Information and Computation, PACLIC 20 |
---|---|
Country/Territory | China |
City | Wuhan |
Period | 1/11/06 → 3/11/06 |
Keywords
- Cross-language information retrieval
- Query sentence
- Translation & transform algorithm