TY - GEN
T1 - An edge-based approach for video text extraction
AU - Shi, Jianyong
AU - Luo, Xiling
AU - Zhang, Jun
PY - 2009
Y1 - 2009
N2 - Text in video is a compact but effective clue for video indexing and summarization. In this paper, we propose an edge-based video text extraction approach with low computation, which can automatically detect and extract text from complex video frames. We first detect the edge maps of both an intensity image and its binarized image, and merge the two into one edge map, which contains less edge pixels of background but enriched edge pixels of text. Then, the projection profile method is used to evaluate the distribution of the resulting edge map in both horizontal and vertical directions. In both directions, an adaptive thresholding method is applied to identify adjacent pixel rows and columns which contain text. The intersections of these rows and columns are extracted as text regions. Finally, a novel extraction method based on monochromatism of text is applied to the regions. The output of the extraction method can be directly fed to OCR. The performance of our approach is demonstrated by presenting experimental results for a set of video clips and static images.
AB - Text in video is a compact but effective clue for video indexing and summarization. In this paper, we propose an edge-based video text extraction approach with low computation, which can automatically detect and extract text from complex video frames. We first detect the edge maps of both an intensity image and its binarized image, and merge the two into one edge map, which contains less edge pixels of background but enriched edge pixels of text. Then, the projection profile method is used to evaluate the distribution of the resulting edge map in both horizontal and vertical directions. In both directions, an adaptive thresholding method is applied to identify adjacent pixel rows and columns which contain text. The intersections of these rows and columns are extracted as text regions. Finally, a novel extraction method based on monochromatism of text is applied to the regions. The output of the extraction method can be directly fed to OCR. The performance of our approach is demonstrated by presenting experimental results for a set of video clips and static images.
KW - Character recognition
KW - Image segmentation
KW - Text extraction
KW - Video processing
UR - http://www.scopus.com/inward/record.url?scp=77951132889&partnerID=8YFLogxK
U2 - 10.1109/ICCTD.2009.177
DO - 10.1109/ICCTD.2009.177
M3 - Conference contribution
AN - SCOPUS:77951132889
SN - 9780769538921
T3 - ICCTD 2009 - 2009 International Conference on Computer Technology and Development
SP - 331
EP - 335
BT - ICCTD 2009 - 2009 International Conference on Computer Technology and Development
T2 - 2009 International Conference on Computer Technology and Development, ICCTD 2009
Y2 - 13 November 2009 through 15 November 2009
ER -