Leveraging Passage-level Cumulative Gain for Document Ranking

Zhijing Wu, Jiaxin Mao, Yiqun Liu, Jingtao Zhan, Yukun Zheng, Min Zhang, Shaoping Ma

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

37 Citations (Scopus)

Abstract

Document ranking is one of the most studied but challenging problems in information retrieval (IR) research. A number of existing document ranking models capture relevance signals at the whole document level. Recently, more and more research has begun to address this problem from fine-grained document modeling. Several works leveraged fine-grained passage-level relevance signals in ranking models. However, most of these works focus on context-independent passage-level relevance signals and ignore the context information, which may lead to inaccurate estimation of passage-level relevance. In this paper, we investigate how information gain accumulates with passages when users sequentially read a document. We propose the context-aware Passage-level Cumulative Gain (PCG), which aggregates relevance scores of passages and avoids the need to formally split a document into independent passages. Next, we incorporate the patterns of PCG into a BERT-based sequential model called Passage-level Cumulative Gain Model (PCGM) to predict the PCG sequence. Finally, we apply PCGM to the document ranking task. Experimental results on two public ad hoc retrieval benchmark datasets show that PCGM outperforms most existing ranking models and also indicates the effectiveness of PCG signals. We believe that this work contributes to improving ranking performance and providing more explainability for document ranking.

Original languageEnglish
Title of host publicationThe Web Conference 2020 - Proceedings of the World Wide Web Conference, WWW 2020
PublisherAssociation for Computing Machinery, Inc
Pages2421-2431
Number of pages11
ISBN (Electronic)9781450370233
DOIs
Publication statusPublished - 20 Apr 2020
Externally publishedYes
Event29th International World Wide Web Conference, WWW 2020 - Taipei, Taiwan, Province of China
Duration: 20 Apr 202024 Apr 2020

Publication series

NameThe Web Conference 2020 - Proceedings of the World Wide Web Conference, WWW 2020

Conference

Conference29th International World Wide Web Conference, WWW 2020
Country/TerritoryTaiwan, Province of China
CityTaipei
Period20/04/2024/04/20

Keywords

  • Passage-level cumulative gain
  • document ranking
  • neural network

Fingerprint

Dive into the research topics of 'Leveraging Passage-level Cumulative Gain for Document Ranking'. Together they form a unique fingerprint.

Cite this