A Tree-Based Indexing Approach for Diverse Textual Similarity Search

Minghe Yu*, Chengliang Chai, Ge Yu

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 4
  • Captures
    • Readers: 7
see details

摘要

Textual information is ubiquitous in our lives and is becoming an important component of our cognitive society. In the age of big data, we consistently need to traverse substantial amounts of data even to find a little information. To quickly acquire effective information, it is necessary to implement a textual similarity search based on an appropriate index structure to efficiently find results. In this article, we study top-k textual similarity search and develop a tree-based indexing approach that can construct indices to support various similarity functions. Our indexing approach clusters similar records in the same branch offline to improve the performance of online search. Based on the index tree, we present a top-k search algorithm with efficient pruning techniques. The experimental results demonstrate that our algorithm can achieve higher performance and better scalability than the baseline method.

源语言英语
文章编号9187345
页(从-至)8866-8876
页数11
期刊IEEE Access
9
DOI
出版状态已出版 - 2021
已对外发布

指纹

探究 'A Tree-Based Indexing Approach for Diverse Textual Similarity Search' 的科研主题。它们共同构成独一无二的指纹。

引用此

Yu, M., Chai, C., & Yu, G. (2021). A Tree-Based Indexing Approach for Diverse Textual Similarity Search. IEEE Access, 9, 8866-8876. 文章 9187345. https://doi.org/10.1109/ACCESS.2020.3022057