Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images

Xiabi Liu; Hui Fu; Yunde Jia

doi:10.1016/j.patcog.2007.06.004

Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images

Xiabi Liu^*, Hui Fu, Yunde Jia

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

40 Citations (Scopus)

Abstract

This paper proposes an approach based on the statistical modeling and learning of neighboring characters to extract multilingual texts in images. The case of three neighboring characters is represented as the Gaussian mixture model and discriminated from other cases by the corresponding 'pseudo-probability' defined under Bayes framework. Based on this modeling, text extraction is completed through labeling each connected component in the binary image as character or non-character according to its neighbors, where a mathematical morphology based method is introduced to detect and connect the separated parts of each character, and a Voronoi partition based method is advised to establish the neighborhoods of connected components. We further present a discriminative training algorithm based on the maximum-minimum similarity (MMS) criterion to estimate the parameters in the proposed text extraction approach. Experimental results in Chinese and English text extraction demonstrate the effectiveness of our approach trained with the MMS algorithm, which achieved the precision rate of 93.56% and the recall rate of 98.55% for the test data set. In the experiments, we also show that the MMS provides significant improvement of overall performance, compared with influential training criterions of the maximum likelihood (ML) and the maximum classification error (MCE).

Original language	English
Pages (from-to)	484-493
Number of pages	10
Journal	Pattern Recognition
Volume	41
Issue number	2
DOIs	https://doi.org/10.1016/j.patcog.2007.06.004
Publication status	Published - Feb 2008

Keywords

Character recognition
Discriminative training
Document analysis
EM algorithm
Gaussian mixture models
Image retrieval
Text extraction

Access to Document

10.1016/j.patcog.2007.06.004

Cite this

@article{18f08843bde8476da83a5c2b5cda099d,

title = "Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images",

abstract = "This paper proposes an approach based on the statistical modeling and learning of neighboring characters to extract multilingual texts in images. The case of three neighboring characters is represented as the Gaussian mixture model and discriminated from other cases by the corresponding 'pseudo-probability' defined under Bayes framework. Based on this modeling, text extraction is completed through labeling each connected component in the binary image as character or non-character according to its neighbors, where a mathematical morphology based method is introduced to detect and connect the separated parts of each character, and a Voronoi partition based method is advised to establish the neighborhoods of connected components. We further present a discriminative training algorithm based on the maximum-minimum similarity (MMS) criterion to estimate the parameters in the proposed text extraction approach. Experimental results in Chinese and English text extraction demonstrate the effectiveness of our approach trained with the MMS algorithm, which achieved the precision rate of 93.56% and the recall rate of 98.55% for the test data set. In the experiments, we also show that the MMS provides significant improvement of overall performance, compared with influential training criterions of the maximum likelihood (ML) and the maximum classification error (MCE).",

keywords = "Character recognition, Discriminative training, Document analysis, EM algorithm, Gaussian mixture models, Image retrieval, Text extraction",

author = "Xiabi Liu and Hui Fu and Yunde Jia",

year = "2008",

month = feb,

doi = "10.1016/j.patcog.2007.06.004",

language = "English",

volume = "41",

pages = "484--493",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd.",

number = "2",

}

TY - JOUR

T1 - Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images

AU - Liu, Xiabi

AU - Fu, Hui

AU - Jia, Yunde

PY - 2008/2

Y1 - 2008/2

N2 - This paper proposes an approach based on the statistical modeling and learning of neighboring characters to extract multilingual texts in images. The case of three neighboring characters is represented as the Gaussian mixture model and discriminated from other cases by the corresponding 'pseudo-probability' defined under Bayes framework. Based on this modeling, text extraction is completed through labeling each connected component in the binary image as character or non-character according to its neighbors, where a mathematical morphology based method is introduced to detect and connect the separated parts of each character, and a Voronoi partition based method is advised to establish the neighborhoods of connected components. We further present a discriminative training algorithm based on the maximum-minimum similarity (MMS) criterion to estimate the parameters in the proposed text extraction approach. Experimental results in Chinese and English text extraction demonstrate the effectiveness of our approach trained with the MMS algorithm, which achieved the precision rate of 93.56% and the recall rate of 98.55% for the test data set. In the experiments, we also show that the MMS provides significant improvement of overall performance, compared with influential training criterions of the maximum likelihood (ML) and the maximum classification error (MCE).

AB - This paper proposes an approach based on the statistical modeling and learning of neighboring characters to extract multilingual texts in images. The case of three neighboring characters is represented as the Gaussian mixture model and discriminated from other cases by the corresponding 'pseudo-probability' defined under Bayes framework. Based on this modeling, text extraction is completed through labeling each connected component in the binary image as character or non-character according to its neighbors, where a mathematical morphology based method is introduced to detect and connect the separated parts of each character, and a Voronoi partition based method is advised to establish the neighborhoods of connected components. We further present a discriminative training algorithm based on the maximum-minimum similarity (MMS) criterion to estimate the parameters in the proposed text extraction approach. Experimental results in Chinese and English text extraction demonstrate the effectiveness of our approach trained with the MMS algorithm, which achieved the precision rate of 93.56% and the recall rate of 98.55% for the test data set. In the experiments, we also show that the MMS provides significant improvement of overall performance, compared with influential training criterions of the maximum likelihood (ML) and the maximum classification error (MCE).

KW - Character recognition

KW - Discriminative training

KW - Document analysis

KW - EM algorithm

KW - Gaussian mixture models

KW - Image retrieval

KW - Text extraction

UR - http://www.scopus.com/inward/record.url?scp=34848906224&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2007.06.004

DO - 10.1016/j.patcog.2007.06.004

M3 - Article

AN - SCOPUS:34848906224

SN - 0031-3203

VL - 41

SP - 484

EP - 493

JO - Pattern Recognition

JF - Pattern Recognition

IS - 2

ER -

Gaussian mixture modeling and learning of neighboring characters for multilingual text extraction in images

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this