TY - JOUR
T1 - Scene text recognition using residual convolutional recurrent neural network
AU - Lei, Zhengchao
AU - Zhao, Sanyuan
AU - Song, Hongmei
AU - Shen, Jianbing
N1 - Publisher Copyright:
© 2018, Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2018/7/1
Y1 - 2018/7/1
N2 - Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method.
AB - Text is a significant tool for human communication, and text recognition in scene images becomes more and more important. In this paper, we propose a residual convolutional recurrent neural network for solving the task of scene text recognition. The general convolutional recurrent neural network (CRNN) is realized by combining convolutional neural network (CNN) with recurrent neural network (RNN). The CNN part extracts features and the RNN part encodes and decodes feature sequences. In order to improve the accuracy rate of scene text recognition based on CRNN, we explore different deeper CNN architectures to get feature descriptors and analyze the corresponding text recognition results. Specifically, VGG and ResNet are introduced to train these different deep models and obtain the encoding information of images. The experimental results on public datasets demonstrate the effectiveness of our method.
KW - Convolutional neural network
KW - Recurrent neural network
KW - Residual convolutional recurrent neural network
KW - Residual network
KW - Scene text recognition
UR - http://www.scopus.com/inward/record.url?scp=85048570009&partnerID=8YFLogxK
U2 - 10.1007/s00138-018-0942-y
DO - 10.1007/s00138-018-0942-y
M3 - Article
AN - SCOPUS:85048570009
SN - 0932-8092
VL - 29
SP - 861
EP - 871
JO - Machine Vision and Applications
JF - Machine Vision and Applications
IS - 5
ER -