Scene text recognition using residual convolutional recurrent neural network

Zhengchao Lei, Sanyuan Zhao*, Hongmei Song, Jianbing Shen

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

33 Citations (Scopus)

Abstract

Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method.

Original languageEnglish
Pages (from-to)861-871
Number of pages11
JournalMachine Vision and Applications
Volume29
Issue number5
DOIs
Publication statusPublished - 1 Jul 2018

Keywords

  • Convolutional neural network
  • Recurrent neural network
  • Residual convolutional recurrent neural network
  • Residual network
  • Scene text recognition

Fingerprint

Dive into the research topics of 'Scene text recognition using residual convolutional recurrent neural network'. Together they form a unique fingerprint.

Cite this