Query Expansion with Local Conceptual Word Embeddings in Microblog Retrieval

Yashen Wang, Heyan Huang, Chong Feng*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

10 Citations (Scopus)

Abstract

Since the length of microblog texts, such as tweets, is strictly limited to 140 characters, traditional Information Retrieval techniques suffer from the vocabulary mismatch problem severely and cannot yield good performance in the context of microblogosphere. To address this critical challenge, in this paper, we focus on the use of local conceptual word embeddings for enhance microblog retrieval effectiveness. In particular, we propose a novel $k$k-Nearest Neighbor ($k$kNN) based Query Expansion (QE) algorithm to generate words from local word embeddings to expand the original query, which leads to better understanding of the information need. Besides, in order to further satisfy users' real-time information need, we incorporate temporal evidences into the expansion algorithm, which can boost recent tweets in the retrieval results with respect to a given topic. Experimental results on the official TREC Twitter corpora demonstrate the significant superiority of our approach over baseline methods.

Original languageEnglish
Article number8861105
Pages (from-to)1737-1749
Number of pages13
JournalIEEE Transactions on Knowledge and Data Engineering
Volume33
Issue number4
DOIs
Publication statusPublished - 1 Apr 2021

Keywords

  • Microblog retrieval
  • pseudo-relevance feedback
  • query expansion
  • word embeddings

Fingerprint

Dive into the research topics of 'Query Expansion with Local Conceptual Word Embeddings in Microblog Retrieval'. Together they form a unique fingerprint.

Cite this