Gaussian mixture modeling of neighbor characters for multilingual text extraction in images

Hui Fu*, Xiabi Liu, Yunde Jia, Hongbin Deng

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

This paper proposes a new method to extract multilingual text in images through discriminating characters from non-characters based on the Gaussian Mixture modeling of neighbor characters. The image is binarized and the morphological closing operation is performed on the binary image, in order that each character in it can be treated as a connected component; The neighborhood of connected components are computed based on the Voronoi partition of the image, and each connected component is labeled as character or non-character according to its neighbors. We applied the proposed text extraction method to Chinese and English text extraction, the effectiveness of which is confirmed by the experimental results.

Original languageEnglish
Title of host publication2006 IEEE International Conference on Image Processing, ICIP 2006 - Proceedings
Pages3321-3324
Number of pages4
DOIs
Publication statusPublished - 2006
Event2006 IEEE International Conference on Image Processing, ICIP 2006 - Atlanta, GA, United States
Duration: 8 Oct 200611 Oct 2006

Publication series

NameProceedings - International Conference on Image Processing, ICIP
ISSN (Print)1522-4880

Conference

Conference2006 IEEE International Conference on Image Processing, ICIP 2006
Country/TerritoryUnited States
CityAtlanta, GA
Period8/10/0611/10/06

Keywords

  • Document image processing
  • Gaussian distributions
  • Text recognition

Fingerprint

Dive into the research topics of 'Gaussian mixture modeling of neighbor characters for multilingual text extraction in images'. Together they form a unique fingerprint.

Cite this