Document boltzmann machines for information retrieval

  • Qian Yu
  • , Peng Zhang*
  • , Yuexian Hou
  • , Dawei Song
  • , Jun Wang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Probabilistic language modelling has been widely used in information retrieval. It estimates document models under the multinomial distribution assumption, and uses query likelihood to rank documents. In this paper, we aim to generalize this distribution assumption by exploring the use of fully-observable Boltzmann Machines (BMs) for document modelling. BM is a stochastic recurrent network and is able to model the distribution of multi-dimensional variables. It yields a kind of Boltzmann distribution which is more general than multinomial distribution. We propose a Document Boltzmann Machine (DBM) that can naturally capture the intrinsic connections among terms and estimate query likelihood efficiently. We formally prove that under certain conditions (with 1-order parameters learnt only), DBM subsumes the traditional document language model. Its relations to other graphical models in IR, e.g., MRF model, are also discussed. Our experiments on the document reranking demonstrate the potential of the proposed DBM.

Original languageEnglish
Title of host publicationAdvances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Proceedings
EditorsAllan Hanbury, Andreas Rauber, Gabriella Kazai, Norbert Fuhr
PublisherSpringer Verlag
Pages666-671
Number of pages6
ISBN (Electronic)9783319163536
DOIs
Publication statusPublished - 2015
Externally publishedYes
Event37th European Conference on Information Retrieval Research, ECIR 2015 - Vienna, Austria
Duration: 29 Mar 20152 Apr 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9022
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference37th European Conference on Information Retrieval Research, ECIR 2015
Country/TerritoryAustria
CityVienna,
Period29/03/152/04/15

Fingerprint

Dive into the research topics of 'Document boltzmann machines for information retrieval'. Together they form a unique fingerprint.

Cite this