摘要
To overcome the problem that the confusion between texts limits the precision in text retrieval, a new text retrieval algorithm that decrease confusion (DCTR) is proposed. The algorithm constructs the searching template to represent the user's searching intention through positive and negative training. By using the prior probabilities in the template, the supported probability and anti-supported probability of each text in the text library can be estimated for discrimination. The searching result can be ranked according to similarities between retrieved texts and the template. The complexity of DCTR is close to term frequency and mversed document frequency (TF-IDF). Its distinguishing ability to confusable texts could be advanced and the performance of the result would be improved with increasing of training times.
| 源语言 | 英语 |
|---|---|
| 页(从-至) | 108-116 |
| 页数 | 9 |
| 期刊 | Journal of Beijing Institute of Technology (English Edition) |
| 卷 | 23 |
| 期 | 1 |
| 出版状态 | 已出版 - 3月 2014 |
指纹
探究 'Text retrieval algorithm that decreases confusion' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver