Learning to rank microblog posts for real-time AD-HOC search

Jing Li*, Zhongyu Wei, Hao Wei, Kangfei Zhao, Junwen Chen, Kam Fai Wong

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Microblogging websites have emerged to the center of information production and diffusion, on which people can get useful information from other users’ microblog posts. In the era of Big Data, we are overwhelmed by the large amount of microblog posts. To make good use of these informative data, an effective search tool is required specialized for microblog posts. However, it is not trivial to do microblog search due to the following reasons: 1) microblog posts are noisy and time-sensitive rendering general information retrieval models ineffective. 2) Conventional IR models are not designed to consider microblog-specific features. In this paper, we propose to utilize learning to rank model for microblog search. We combine content-based, microblog-specific and temporal features into learning to rank models, which are found to model microblog posts effectively. To study the performance of learning to rank models, we evaluate our models using tweet data set provided by TERC 2011 and TREC 2012 microblogs track with the comparison of three stateof-the-art information retrieval baselines, vector space model, language model, BM25 model. Extensive experimental studies demonstrate the effectiveness of learning to rank models and the usefulness to integrate microblog-specific and temporal information for microblog search task.

Original languageEnglish
Title of host publicationNatural Language Processing and Chinese Computing - 4th CCF Conference, NLPCC 2015, Proceedings
EditorsHeng Ji, Dongyan Zhao, Yansong Feng, Juanzi Li
PublisherSpringer Verlag
Pages436-443
Number of pages8
ISBN (Print)9783319252063
DOIs
Publication statusPublished - 2015
Externally publishedYes
Event4th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2015 - Nanchang, China
Duration: 9 Oct 201513 Oct 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9362
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference4th CCF Conference on Natural Language Processing and Chinese Computing, NLPCC 2015
Country/TerritoryChina
CityNanchang
Period9/10/1513/10/15

Keywords

  • Experimental study
  • Information retrieval
  • Microblog search
  • Microblogging analysis
  • Online social network

Fingerprint

Dive into the research topics of 'Learning to rank microblog posts for real-time AD-HOC search'. Together they form a unique fingerprint.

Cite this