Document boltzmann machines for information retrieval

Qian Yu; Peng Zhang; Yuexian Hou; Dawei Song; Jun Wang

doi:10.1007/978-3-319-16354-3_73

Document boltzmann machines for information retrieval

Qian Yu, Peng Zhang^*, Yuexian Hou, Dawei Song, Jun Wang

^*此作品的通讯作者

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

1 引用（Scopus）

摘要

Probabilistic language modelling has been widely used in information retrieval. It estimates document models under the multinomial distribution assumption, and uses query likelihood to rank documents. In this paper, we aim to generalize this distribution assumption by exploring the use of fully-observable Boltzmann Machines (BMs) for document modelling. BM is a stochastic recurrent network and is able to model the distribution of multi-dimensional variables. It yields a kind of Boltzmann distribution which is more general than multinomial distribution. We propose a Document Boltzmann Machine (DBM) that can naturally capture the intrinsic connections among terms and estimate query likelihood efficiently. We formally prove that under certain conditions (with 1-order parameters learnt only), DBM subsumes the traditional document language model. Its relations to other graphical models in IR, e.g., MRF model, are also discussed. Our experiments on the document reranking demonstrate the potential of the proposed DBM.

源语言	英语
主期刊名	Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Proceedings
编辑	Allan Hanbury, Andreas Rauber, Gabriella Kazai, Norbert Fuhr
出版商	Springer Verlag
页	666-671
页数	6
ISBN（电子版）	9783319163536
DOI	https://doi.org/10.1007/978-3-319-16354-3_73
出版状态	已出版 - 2015
已对外发布	是
活动	37th European Conference on Information Retrieval Research, ECIR 2015 - Vienna, 奥地利期限: 29 3月 2015 → 2 4月 2015

出版系列

姓名	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
卷	9022
ISSN（印刷版）	0302-9743
ISSN（电子版）	1611-3349

会议

会议	37th European Conference on Information Retrieval Research, ECIR 2015
国家/地区	奥地利
市	Vienna,
时期	29/03/15 → 2/04/15

访问文件

10.1007/978-3-319-16354-3_73

其它文件与链接

链接到 Scopus 的出版物

引用此

Yu, Q., Zhang, P., Hou, Y., Song, D., & Wang, J. (2015). Document boltzmann machines for information retrieval. 在 A. Hanbury, A. Rauber, G. Kazai, & N. Fuhr (编辑), Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Proceedings (页码 666-671). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 9022). Springer Verlag. https://doi.org/10.1007/978-3-319-16354-3_73

Yu, Qian ; Zhang, Peng ; Hou, Yuexian 等. / Document boltzmann machines for information retrieval. Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Proceedings. 编辑 / Allan Hanbury ; Andreas Rauber ; Gabriella Kazai ; Norbert Fuhr. Springer Verlag, 2015. 页码 666-671 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{53adda814bfd4b4db8de251223869775,

title = "Document boltzmann machines for information retrieval",

abstract = "Probabilistic language modelling has been widely used in information retrieval. It estimates document models under the multinomial distribution assumption, and uses query likelihood to rank documents. In this paper, we aim to generalize this distribution assumption by exploring the use of fully-observable Boltzmann Machines (BMs) for document modelling. BM is a stochastic recurrent network and is able to model the distribution of multi-dimensional variables. It yields a kind of Boltzmann distribution which is more general than multinomial distribution. We propose a Document Boltzmann Machine (DBM) that can naturally capture the intrinsic connections among terms and estimate query likelihood efficiently. We formally prove that under certain conditions (with 1-order parameters learnt only), DBM subsumes the traditional document language model. Its relations to other graphical models in IR, e.g., MRF model, are also discussed. Our experiments on the document reranking demonstrate the potential of the proposed DBM.",

author = "Qian Yu and Peng Zhang and Yuexian Hou and Dawei Song and Jun Wang",

note = "Publisher Copyright: {\textcopyright} Springer International Publishing Switzerland 2015.; 37th European Conference on Information Retrieval Research, ECIR 2015 ; Conference date: 29-03-2015 Through 02-04-2015",

year = "2015",

doi = "10.1007/978-3-319-16354-3_73",

language = "English",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "666--671",

editor = "Allan Hanbury and Andreas Rauber and Gabriella Kazai and Norbert Fuhr",

booktitle = "Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Proceedings",

address = "Germany",

}

Yu, Q, Zhang, P, Hou, Y, Song, D & Wang, J 2015, Document boltzmann machines for information retrieval. 在 A Hanbury, A Rauber, G Kazai & N Fuhr (编辑), Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 卷 9022, Springer Verlag, 页码 666-671, 37th European Conference on Information Retrieval Research, ECIR 2015, Vienna, 奥地利, 29/03/15. https://doi.org/10.1007/978-3-319-16354-3_73

Document boltzmann machines for information retrieval. / Yu, Qian; Zhang, Peng; Hou, Yuexian 等.
Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Proceedings. 编辑 / Allan Hanbury; Andreas Rauber; Gabriella Kazai; Norbert Fuhr. Springer Verlag, 2015. 页码 666-671 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 9022).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Document boltzmann machines for information retrieval

AU - Yu, Qian

AU - Zhang, Peng

AU - Hou, Yuexian

AU - Song, Dawei

AU - Wang, Jun

N1 - Publisher Copyright: © Springer International Publishing Switzerland 2015.

PY - 2015

Y1 - 2015

N2 - Probabilistic language modelling has been widely used in information retrieval. It estimates document models under the multinomial distribution assumption, and uses query likelihood to rank documents. In this paper, we aim to generalize this distribution assumption by exploring the use of fully-observable Boltzmann Machines (BMs) for document modelling. BM is a stochastic recurrent network and is able to model the distribution of multi-dimensional variables. It yields a kind of Boltzmann distribution which is more general than multinomial distribution. We propose a Document Boltzmann Machine (DBM) that can naturally capture the intrinsic connections among terms and estimate query likelihood efficiently. We formally prove that under certain conditions (with 1-order parameters learnt only), DBM subsumes the traditional document language model. Its relations to other graphical models in IR, e.g., MRF model, are also discussed. Our experiments on the document reranking demonstrate the potential of the proposed DBM.

AB - Probabilistic language modelling has been widely used in information retrieval. It estimates document models under the multinomial distribution assumption, and uses query likelihood to rank documents. In this paper, we aim to generalize this distribution assumption by exploring the use of fully-observable Boltzmann Machines (BMs) for document modelling. BM is a stochastic recurrent network and is able to model the distribution of multi-dimensional variables. It yields a kind of Boltzmann distribution which is more general than multinomial distribution. We propose a Document Boltzmann Machine (DBM) that can naturally capture the intrinsic connections among terms and estimate query likelihood efficiently. We formally prove that under certain conditions (with 1-order parameters learnt only), DBM subsumes the traditional document language model. Its relations to other graphical models in IR, e.g., MRF model, are also discussed. Our experiments on the document reranking demonstrate the potential of the proposed DBM.

UR - http://www.scopus.com/inward/record.url?scp=84925435410&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-16354-3_73

DO - 10.1007/978-3-319-16354-3_73

M3 - Conference contribution

AN - SCOPUS:84925435410

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 666

EP - 671

BT - Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Proceedings

A2 - Hanbury, Allan

A2 - Rauber, Andreas

A2 - Kazai, Gabriella

A2 - Fuhr, Norbert

PB - Springer Verlag

T2 - 37th European Conference on Information Retrieval Research, ECIR 2015

Y2 - 29 March 2015 through 2 April 2015

ER -

Yu Q, Zhang P, Hou Y, Song D, Wang J. Document boltzmann machines for information retrieval. 在 Hanbury A, Rauber A, Kazai G, Fuhr N, 编辑, Advances in Information Retrieval - 37th European Conference on IR Research, ECIR 2015, Proceedings. Springer Verlag. 2015. 页码 666-671. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-16354-3_73

Document boltzmann machines for information retrieval

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此