Neural variational correlated topic modeling

Luyang Liu; Heyan Huang; Yang Gao; Xiaochi Wei; Yongfeng Zhang

doi:10.1145/3308558.3313561

Neural variational correlated topic modeling

Luyang Liu, Heyan Huang^*, Yang Gao, Xiaochi Wei, Yongfeng Zhang

^*此作品的通讯作者

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

38 引用（Scopus）

摘要

With the rapid development of the Internet, millions of documents, such as news and web pages, are generated everyday. Mining the topics and knowledge on them has attracted a lot of interest on both academic and industrial areas. As one of the prevalent unsupervised data mining tools, topic models are usually explored as probabilistic generative models for large collections of texts. Traditional probabilistic topic models tend to find a closed form solution of model parameters and approach the intractable posteriors via approximation methods, which usually lead to the inaccurate inference of parameters and low efficiency when it comes to a quite large volume of data. Recently, an emerging trend of neural variational inference can overcome the above issues, which offers a scalable and powerful deep generative framework for modeling latent topics via neural networks. Interestingly, a common assumption for the most neural variational topic models is that topics are independent and irrelevant to each other. However, this assumption is unreasonable in many practical scenarios. In this paper, we propose a novel Centralized Transformation Flow to capture the correlations among topics by reshaping topic distributions. Furthermore, we present the Transformation Flow Lower Bound to improve the performance of the proposed model. Extensive experiments on two standard benchmark datasets have well-validated the effectiveness of the proposed approach.

源语言	英语
主期刊名	The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019
出版商	Association for Computing Machinery, Inc
页	1142-1152
页数	11
ISBN（电子版）	9781450366748
DOI	https://doi.org/10.1145/3308558.3313561
出版状态	已出版 - 13 5月 2019
活动	2019 World Wide Web Conference, WWW 2019 - San Francisco, 美国期限: 13 5月 2019 → 17 5月 2019

出版系列

姓名	The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019

会议

会议	2019 World Wide Web Conference, WWW 2019
国家/地区	美国
市	San Francisco
时期	13/05/19 → 17/05/19

访问文件

10.1145/3308558.3313561

其它文件与链接

链接到 Scopus 的出版物

引用此

Liu, L., Huang, H., Gao, Y., Wei, X., & Zhang, Y. (2019). Neural variational correlated topic modeling. 在 The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019 (页码 1142-1152). (The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019). Association for Computing Machinery, Inc. https://doi.org/10.1145/3308558.3313561

@inproceedings{cde5c6621e264f6aaeb2ed77436934a1,

title = "Neural variational correlated topic modeling",

abstract = "With the rapid development of the Internet, millions of documents, such as news and web pages, are generated everyday. Mining the topics and knowledge on them has attracted a lot of interest on both academic and industrial areas. As one of the prevalent unsupervised data mining tools, topic models are usually explored as probabilistic generative models for large collections of texts. Traditional probabilistic topic models tend to find a closed form solution of model parameters and approach the intractable posteriors via approximation methods, which usually lead to the inaccurate inference of parameters and low efficiency when it comes to a quite large volume of data. Recently, an emerging trend of neural variational inference can overcome the above issues, which offers a scalable and powerful deep generative framework for modeling latent topics via neural networks. Interestingly, a common assumption for the most neural variational topic models is that topics are independent and irrelevant to each other. However, this assumption is unreasonable in many practical scenarios. In this paper, we propose a novel Centralized Transformation Flow to capture the correlations among topics by reshaping topic distributions. Furthermore, we present the Transformation Flow Lower Bound to improve the performance of the proposed model. Extensive experiments on two standard benchmark datasets have well-validated the effectiveness of the proposed approach.",

keywords = "Natural language processing, Neural variational inference, Topic model",

author = "Luyang Liu and Heyan Huang and Yang Gao and Xiaochi Wei and Yongfeng Zhang",

note = "Publisher Copyright: {\textcopyright} 2019 IW3C2 (International World Wide Web Conference Committee), published under Creative Commons CC-BY 4.0 License.; 2019 World Wide Web Conference, WWW 2019 ; Conference date: 13-05-2019 Through 17-05-2019",

year = "2019",

month = may,

day = "13",

doi = "10.1145/3308558.3313561",

language = "English",

series = "The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019",

publisher = "Association for Computing Machinery, Inc",

pages = "1142--1152",

booktitle = "The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019",

}

Liu, L, Huang, H, Gao, Y, Wei, X & Zhang, Y 2019, Neural variational correlated topic modeling. 在 The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019. The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019, Association for Computing Machinery, Inc, 页码 1142-1152, 2019 World Wide Web Conference, WWW 2019, San Francisco, 美国, 13/05/19. https://doi.org/10.1145/3308558.3313561

Neural variational correlated topic modeling. / Liu, Luyang; Huang, Heyan; Gao, Yang 等.
The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019. Association for Computing Machinery, Inc, 2019. 页码 1142-1152 (The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Neural variational correlated topic modeling

AU - Liu, Luyang

AU - Huang, Heyan

AU - Gao, Yang

AU - Wei, Xiaochi

AU - Zhang, Yongfeng

PY - 2019/5/13

Y1 - 2019/5/13

N2 - With the rapid development of the Internet, millions of documents, such as news and web pages, are generated everyday. Mining the topics and knowledge on them has attracted a lot of interest on both academic and industrial areas. As one of the prevalent unsupervised data mining tools, topic models are usually explored as probabilistic generative models for large collections of texts. Traditional probabilistic topic models tend to find a closed form solution of model parameters and approach the intractable posteriors via approximation methods, which usually lead to the inaccurate inference of parameters and low efficiency when it comes to a quite large volume of data. Recently, an emerging trend of neural variational inference can overcome the above issues, which offers a scalable and powerful deep generative framework for modeling latent topics via neural networks. Interestingly, a common assumption for the most neural variational topic models is that topics are independent and irrelevant to each other. However, this assumption is unreasonable in many practical scenarios. In this paper, we propose a novel Centralized Transformation Flow to capture the correlations among topics by reshaping topic distributions. Furthermore, we present the Transformation Flow Lower Bound to improve the performance of the proposed model. Extensive experiments on two standard benchmark datasets have well-validated the effectiveness of the proposed approach.

AB - With the rapid development of the Internet, millions of documents, such as news and web pages, are generated everyday. Mining the topics and knowledge on them has attracted a lot of interest on both academic and industrial areas. As one of the prevalent unsupervised data mining tools, topic models are usually explored as probabilistic generative models for large collections of texts. Traditional probabilistic topic models tend to find a closed form solution of model parameters and approach the intractable posteriors via approximation methods, which usually lead to the inaccurate inference of parameters and low efficiency when it comes to a quite large volume of data. Recently, an emerging trend of neural variational inference can overcome the above issues, which offers a scalable and powerful deep generative framework for modeling latent topics via neural networks. Interestingly, a common assumption for the most neural variational topic models is that topics are independent and irrelevant to each other. However, this assumption is unreasonable in many practical scenarios. In this paper, we propose a novel Centralized Transformation Flow to capture the correlations among topics by reshaping topic distributions. Furthermore, we present the Transformation Flow Lower Bound to improve the performance of the proposed model. Extensive experiments on two standard benchmark datasets have well-validated the effectiveness of the proposed approach.

KW - Natural language processing

KW - Neural variational inference

KW - Topic model

UR - http://www.scopus.com/inward/record.url?scp=85066890881&partnerID=8YFLogxK

U2 - 10.1145/3308558.3313561

DO - 10.1145/3308558.3313561

M3 - Conference contribution

AN - SCOPUS:85066890881

T3 - The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019

SP - 1142

EP - 1152

BT - The Web Conference 2019 - Proceedings of the World Wide Web Conference, WWW 2019

PB - Association for Computing Machinery, Inc

T2 - 2019 World Wide Web Conference, WWW 2019

Y2 - 13 May 2019 through 17 May 2019

ER -

Neural variational correlated topic modeling

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此