An online inference algorithm for Labeled Latent Dirichlet allocation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Using topic models to analyze documents is a popular method in text mining. Labeled Latent Dirichlet Allocation(Labeled LDA) is one of them that is widely used to model tagged documents and to solve relevant problems, such as tagged document visualization, snippet extraction and so on. However, traditional batch inference for Labeled LDA, which runs over entire document collection, is computationally expensive and not suitable for large scale corpora and text streams. In this paper, we develop an efficient online algorithm for Labeled LDA, called online Labeled LDA(online-LLDA). It is based on particle filter, a Sequential Monte Carlo approximation technique. Our experiments show that online-LLDA significantly outperforms batch algorithm(batch- LLDA) in time, while preserving equivalent quality.

Original languageEnglish
Title of host publicationWeb Technologies and Applications - 17th Asia-PacificWeb Conference,APWeb 2015, Proceedings
EditorsReynold Cheng, Bin Cui, Zhenjie Zhang, Ruichu Cai, Jia Xu
PublisherSpringer Verlag
Pages17-28
Number of pages12
ISBN (Print)9783319252544
DOIs
Publication statusPublished - 2015
Event17th Asia-PacificWeb Conference, APWeb 2015 - Guangzhou, China
Duration: 18 Sept 201520 Sept 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9313
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th Asia-PacificWeb Conference, APWeb 2015
Country/TerritoryChina
CityGuangzhou
Period18/09/1520/09/15

Keywords

  • Online inference
  • Online labeled lda
  • Particle filter

Fingerprint

Dive into the research topics of 'An online inference algorithm for Labeled Latent Dirichlet allocation'. Together they form a unique fingerprint.

Cite this