Scene Text Recognition with Linear Constrained Rectification

Gang Wang, Hua Ping Zhang, Jian Yun Shang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Scene Text Recognition remains a challenging problem because of various styles and image distortions. This paper proposed an end-to-end trainable model with a rectification module network.The rectification module adopts a polynomial based spatial transform network to rectify the distorted input image, the feature representation between the rectification and encoding step is shared. The model can be trained with the scene images and the corresponding word labels. With the flexible rectifying and feature sharing, this model outperforms previous works through the extensive evaluation results on the standard benchmarks, especially on irregular datasets, 80.2% on IC15 and 85.4% on CUTE, more specifically.

Original languageEnglish
Title of host publicationProceedings - 2020 International Conference on Computational Science and Computational Intelligence, CSCI 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1568-1574
Number of pages7
ISBN (Electronic)9781728176246
DOIs
Publication statusPublished - Dec 2020
Event2020 International Conference on Computational Science and Computational Intelligence, CSCI 2020 - Las Vegas, United States
Duration: 16 Dec 202018 Dec 2020

Publication series

NameProceedings - 2020 International Conference on Computational Science and Computational Intelligence, CSCI 2020

Conference

Conference2020 International Conference on Computational Science and Computational Intelligence, CSCI 2020
Country/TerritoryUnited States
CityLas Vegas
Period16/12/2018/12/20

Keywords

  • Constrained Rectification
  • Scene Text Recognition
  • Spatial Transform Network

Fingerprint

Dive into the research topics of 'Scene Text Recognition with Linear Constrained Rectification'. Together they form a unique fingerprint.

Cite this