EHLLDA: A supervised hierarchical topic model

Xian Ling Mao*, Yixuan Xiao, Qiang Zhou, Jun Wang, Heyan Huang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

In this paper, we consider the problem of modeling hierarchical labeled data – such as Web pages and their placement in hierarchical directories. The state-of-the-art model, hierarchical Labeled LDA (hLLDA), assumes that each child of a non-leaf label has equal importance, and that a document in the corpus cannot locate in a non-leaf node. However, in most cases, these assumptions do not meet the actual situation. Thus, in this paper, we introduce a supervised hierarchical topic models: Extended Hierarchical Labeled Latent Dirichlet Allocation (EHLLDA), which aim to relax the assumptions of hLLDA by incorporating prior information of labels into hLLDA. The experimental results show that the perplexity performance of EHLLDA is always better than that of LLDA and hLLDA on all four datasets; and our proposed model is also superior to hLLDA in terms of p@n.

Original languageEnglish
Title of host publicationChinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data - 14th China National Conference, CCL 2015 and 3rd International Symposium, NLP-NABD 2015, Proceedings
EditorsMaosong Sun, Zhiyuan Liu, Yang Liu, Min Zhang
PublisherSpringer Verlag
Pages215-226
Number of pages12
ISBN (Print)9783319258157
DOIs
Publication statusPublished - 2015
Event14th China National Conference on Chinese Computational Linguistics, CCL 2015 and 3rd International Symposium on Natural Language Processing Based on Naturally Annotated Big Data, NLP-NABD 2015 - Guangzhou, China
Duration: 13 Nov 201514 Nov 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9427
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th China National Conference on Chinese Computational Linguistics, CCL 2015 and 3rd International Symposium on Natural Language Processing Based on Naturally Annotated Big Data, NLP-NABD 2015
Country/TerritoryChina
CityGuangzhou
Period13/11/1514/11/15

Keywords

  • Hierarchical topic modeling
  • Supervised learning
  • Topic modeling

Fingerprint

Dive into the research topics of 'EHLLDA: A supervised hierarchical topic model'. Together they form a unique fingerprint.

Cite this