On semi-supervised learning genetic-based and deterministic annealing EM algorithm for Dirichlet mixture models

Jing Hua Bai, Kan Li, Xiao Xian Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We propose a genetic-based and deterministic annealing expectation- maximization (GA&DA-EM) algorithm for learning Dirichlet mixture models from multivariate data. This algorithm is capable of selecting the number of components of the model using the minimum description length (MDL) criterion. Our approach benefits from the properties of Genetic algorithms and deterministic annealing algorithm by combination of both into a single procedure. The population-based stochastic search of the GA&DA explores the search space more thoroughly than the EM method. Therefore, our algorithm enables escaping from local optimal solutions since the algorithm becomes less sensitive to its initialization. The GA&DA-EM algorithm is elitist which maintains the monotonic convergence property of the EM algorithm. We conducted experiments on the WebKB and 20NEWSGROUPS. The results show that show that 1) the GA&DA-EM outperforms the EM method since: Our approach identifies the number of components which were used to generate the underlying data more often than the EM algorithm. 2) the algorithm alternatives to EM that overcoming the challenges of local maxima.

Original languageEnglish
Title of host publicationQuantum, Nano, Micro and Information Technologies
Pages151-156
Number of pages6
DOIs
Publication statusPublished - 2011
Event2010 International Symposium on Quantum, Nano and Micro Technologies, ISQNM 2010 - Chengdu, China
Duration: 27 Oct 201028 Oct 2010

Publication series

NameApplied Mechanics and Materials
Volume39
ISSN (Print)1660-9336
ISSN (Electronic)1662-7482

Conference

Conference2010 International Symposium on Quantum, Nano and Micro Technologies, ISQNM 2010
Country/TerritoryChina
CityChengdu
Period27/10/1028/10/10

Keywords

  • Deterministic annealing
  • Dirichlet mixture models
  • Genetic-based
  • Model selection
  • Semi-supervised learning

Fingerprint

Dive into the research topics of 'On semi-supervised learning genetic-based and deterministic annealing EM algorithm for Dirichlet mixture models'. Together they form a unique fingerprint.

Cite this