Skip to main navigation Skip to search Skip to main content

Text retrieval algorithm that decreases confusion

  • Yun Chen Jiang
  • , Sen Lin Luo
  • , Lei Han
  • , Li Min Pan*
  • *Corresponding author for this work
  • Beijing Institute of Technology

Research output: Contribution to journalArticlepeer-review

Abstract

To overcome the problem that the confusion between texts limits the precision in text retrieval, a new text retrieval algorithm that decrease confusion (DCTR) is proposed. The algorithm constructs the searching template to represent the user's searching intention through positive and negative training. By using the prior probabilities in the template, the supported probability and anti-supported probability of each text in the text library can be estimated for discrimination. The searching result can be ranked according to similarities between retrieved texts and the template. The complexity of DCTR is close to term frequency and mversed document frequency (TF-IDF). Its distinguishing ability to confusable texts could be advanced and the performance of the result would be improved with increasing of training times.

Original languageEnglish
Pages (from-to)108-116
Number of pages9
JournalJournal of Beijing Institute of Technology (English Edition)
Volume23
Issue number1
Publication statusPublished - Mar 2014

Keywords

  • Confusable text
  • Positive and negative training
  • Supported probability
  • Text retrieval

Fingerprint

Dive into the research topics of 'Text retrieval algorithm that decreases confusion'. Together they form a unique fingerprint.

Cite this