Deep Contextual Stroke Pooling for Scene Character Recognition

Zhong Zhang*, Hong Wang, Shuang Liu, Baihua Xiao

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

Characters, as a kind of symbols carrying rich semantic information, are composed of strokes arranged in a certain structure and are of great significance in our daily life. In this paper, we are concerned with the problem of scene character recognition, and study the problem from the perspective of feature representation. We propose a novel pooling method termed deep contextual stroke pooling (DCSP) for scene character recognition. The proposed DCSP discovers the most prominent stroke information by using stroke detectors and captures the spatial context of discriminative strokes by learning contextual factor. Specifically, we first utilize the convolutional summing map in one convolutional layer to select discriminative strokes and use the convolutional activation features of discriminative strokes to train stroke detectors. Then, we propose the contextual factor to represent the co-occurrence probability of the stroke and its location. Finally, in the response regions, we incorporate the contextual factor into the detector scores and obtain the deep contextual confidence vectors of scene characters. Extensive experiments are conducted on three databases, i.e., ICDAR2003, Chars74k, and SVHN, and the experimental results demonstrate that our method achieves higher accuracies than the state-of-the-art methods.

Original languageEnglish
Pages (from-to)16454-16463
Number of pages10
JournalIEEE Access
Volume6
DOIs
Publication statusPublished - 17 Mar 2018
Externally publishedYes

Keywords

  • Scene character recognition
  • contextual factor
  • deep contextual stroke pooling

Fingerprint

Dive into the research topics of 'Deep Contextual Stroke Pooling for Scene Character Recognition'. Together they form a unique fingerprint.

Cite this