A robust video text extraction and recognition approach using OCR feedback information

Guangyu Gao*, He Zhang, Hongting Chen

*此作品的通讯作者

科研成果: 期刊稿件会议文章同行评审

2 引用 (Scopus)

摘要

Video text is very important semantic information, which brings precise and meaningful clues for video indexing and retrieval. However, most previous approaches did video text extraction and recognition separately, while the main difficulty of extraction and recognition with complex background wasn’t handled very well. In this paper, these difficulty is investigated by combining text extraction and recognition together as well as using OCR feedback information. The following features are highlighted in our approach: (i) an efficient character image segmentation method is proposed in consideration of most prior knowledge. (ii) text extraction are implemented both on text-row and segmented single character images, since text-row based extraction maintains the color consistency of characters and backgrounds while single character has simpler background. After that, the best binary image is chosen for recognition with OCR feedback. (iii) The K-means algorithm is used for extraction which ensures that the best extraction result is involved, which is the binary image with clear classification of text strokes and background. Finally, extensive experiments and empirical evaluations on several video text images are conducted to demonstrate the satisfying performance of the proposed approach.

源语言英语
页(从-至)507-517
页数11
期刊Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
9314
DOI
出版状态已出版 - 2015
活动16th Pacific-Rim Conference on Multimedia, PCM 2015 - Gwangju, 韩国
期限: 16 9月 201518 9月 2015

指纹

探究 'A robust video text extraction and recognition approach using OCR feedback information' 的科研主题。它们共同构成独一无二的指纹。

引用此