Skip to main navigation Skip to search Skip to main content

Bilateral convolutional activations encoded with fisher vectors for scene character recognition

  • Zhong Zhang
  • , Hong Wang
  • , Shuang Liu
  • , Tariq S. Durrani
  • Tianjin Normal University
  • University of Strathclyde

Research output: Contribution to journalArticlepeer-review

Abstract

A rich and robust representation for scene characters plays a significant role in automatically understanding the text in images. In this letter, we focus on the issue of feature representation, and propose a novel encoding method named bilateral convolutional activations encoded with Fisher vectors (BCA-FV) for scene character recognition. Concretely, we first extract convolutional activation descriptors from convolutional maps and then build a bilateral convolutional activation map (BCAM) to capture the relationship between the convolutional activation response and the spatial structure information. Finally, in order to obtain the global feature representation, the BCAM is injected into FV to encode convolutional activation descriptors. Hence, the BCA-FV can effectively integrate the prominent features and spatial structure information for character representation. We verify our method on two widely used databases (ICDAR2003 and Chars74K), and the experimental results demonstrate that our method achieves better results than the state-of-the-art methods. In addition, we further validate the proposed BCA-FV on the "Pan+ChiPhoto" database for Chinese scene character recognition, and the experimental results show the good generalization ability of the proposed BCA-FV.

Original languageEnglish
Pages (from-to)1453-1456
Number of pages4
JournalIEICE Transactions on Information and Systems
VolumeE101D
Issue number5
DOIs
Publication statusPublished - May 2018

Keywords

  • Bilateral convolutional activations
  • Fisher vectors
  • Scene character recognition

Fingerprint

Dive into the research topics of 'Bilateral convolutional activations encoded with fisher vectors for scene character recognition'. Together they form a unique fingerprint.

Cite this