Image-similarity-based convolutional neural network for robot visual relocalization

Li Wang; Ruifeng Li; Jingwen Sun; Hock Soon Seah; Chee Kwang Quah; Lijun Zhao; Budianto Tandianus

doi:10.18494/SAM.2020.2549

Image-similarity-based convolutional neural network for robot visual relocalization

Li Wang, Ruifeng Li, Jingwen Sun^*, Hock Soon Seah, Chee Kwang Quah, Lijun Zhao, Budianto Tandianus

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

4 Citations (Scopus)

Abstract

Convolutional neural network (CNN)-based methods, which train an end-to-end model to regress a six degree of freedom (DoF) pose of a robot from a single red–green–blue (RGB) image, have been developed to overcome the poor robustness of robot visual relocalization recently. However, the pose precision becomes low when the test image is dissimilar to training images. In this paper, we propose a novel method, named image-similarity-based CNN, which considers the image similarity of an input image during the CNN training. The higher the similarity of the input image, the higher precision we can achieve. Therefore, we crop the input image into several small image blocks, and the similarity between each cropped image block and training dataset images is measured by employing a feature vector in a fully connected CNN layer. Finally, the most similar image is selected to regress the pose. A genetic algorithm is utilized to determine the cropped position. Experiments on both open-source dataset 7-Scenes and two actual indoor environments are conducted. The results show that the proposed algorithm leads to better results and reduces large regression errors effectively compared with existing solutions.

Original language	English
Pages (from-to)	1245-1259
Number of pages	15
Journal	Sensors and Materials
Volume	32
Issue number	4
DOIs	https://doi.org/10.18494/SAM.2020.2549
Publication status	Published - 10 Apr 2020
Externally published	Yes

Keywords

CNN
Image similarity
Visual relocalization

Access to Document

10.18494/SAM.2020.2549

Cite this

Wang, L., Li, R., Sun, J., Seah, H. S., Quah, C. K., Zhao, L., & Tandianus, B. (2020). Image-similarity-based convolutional neural network for robot visual relocalization. Sensors and Materials, 32(4), 1245-1259. https://doi.org/10.18494/SAM.2020.2549

@article{23af54c669344fdd8b7b9fa2409a7de1,

title = "Image-similarity-based convolutional neural network for robot visual relocalization",

abstract = "Convolutional neural network (CNN)-based methods, which train an end-to-end model to regress a six degree of freedom (DoF) pose of a robot from a single red–green–blue (RGB) image, have been developed to overcome the poor robustness of robot visual relocalization recently. However, the pose precision becomes low when the test image is dissimilar to training images. In this paper, we propose a novel method, named image-similarity-based CNN, which considers the image similarity of an input image during the CNN training. The higher the similarity of the input image, the higher precision we can achieve. Therefore, we crop the input image into several small image blocks, and the similarity between each cropped image block and training dataset images is measured by employing a feature vector in a fully connected CNN layer. Finally, the most similar image is selected to regress the pose. A genetic algorithm is utilized to determine the cropped position. Experiments on both open-source dataset 7-Scenes and two actual indoor environments are conducted. The results show that the proposed algorithm leads to better results and reduces large regression errors effectively compared with existing solutions.",

keywords = "CNN, Image similarity, Visual relocalization",

author = "Li Wang and Ruifeng Li and Jingwen Sun and Seah, {Hock Soon} and Quah, {Chee Kwang} and Lijun Zhao and Budianto Tandianus",

note = "Publisher Copyright: {\textcopyright} MYU K.K.",

year = "2020",

month = apr,

day = "10",

doi = "10.18494/SAM.2020.2549",

language = "English",

volume = "32",

pages = "1245--1259",

journal = "Sensors and Materials",

issn = "0914-4935",

publisher = "M Y U Scientific Publishing Division",

number = "4",

}

TY - JOUR

T1 - Image-similarity-based convolutional neural network for robot visual relocalization

AU - Wang, Li

AU - Li, Ruifeng

AU - Sun, Jingwen

AU - Seah, Hock Soon

AU - Quah, Chee Kwang

AU - Zhao, Lijun

AU - Tandianus, Budianto

PY - 2020/4/10

Y1 - 2020/4/10

N2 - Convolutional neural network (CNN)-based methods, which train an end-to-end model to regress a six degree of freedom (DoF) pose of a robot from a single red–green–blue (RGB) image, have been developed to overcome the poor robustness of robot visual relocalization recently. However, the pose precision becomes low when the test image is dissimilar to training images. In this paper, we propose a novel method, named image-similarity-based CNN, which considers the image similarity of an input image during the CNN training. The higher the similarity of the input image, the higher precision we can achieve. Therefore, we crop the input image into several small image blocks, and the similarity between each cropped image block and training dataset images is measured by employing a feature vector in a fully connected CNN layer. Finally, the most similar image is selected to regress the pose. A genetic algorithm is utilized to determine the cropped position. Experiments on both open-source dataset 7-Scenes and two actual indoor environments are conducted. The results show that the proposed algorithm leads to better results and reduces large regression errors effectively compared with existing solutions.

AB - Convolutional neural network (CNN)-based methods, which train an end-to-end model to regress a six degree of freedom (DoF) pose of a robot from a single red–green–blue (RGB) image, have been developed to overcome the poor robustness of robot visual relocalization recently. However, the pose precision becomes low when the test image is dissimilar to training images. In this paper, we propose a novel method, named image-similarity-based CNN, which considers the image similarity of an input image during the CNN training. The higher the similarity of the input image, the higher precision we can achieve. Therefore, we crop the input image into several small image blocks, and the similarity between each cropped image block and training dataset images is measured by employing a feature vector in a fully connected CNN layer. Finally, the most similar image is selected to regress the pose. A genetic algorithm is utilized to determine the cropped position. Experiments on both open-source dataset 7-Scenes and two actual indoor environments are conducted. The results show that the proposed algorithm leads to better results and reduces large regression errors effectively compared with existing solutions.

KW - CNN

KW - Image similarity

KW - Visual relocalization

UR - http://www.scopus.com/inward/record.url?scp=85084051492&partnerID=8YFLogxK

U2 - 10.18494/SAM.2020.2549

DO - 10.18494/SAM.2020.2549

M3 - Article

AN - SCOPUS:85084051492

SN - 0914-4935

VL - 32

SP - 1245

EP - 1259

JO - Sensors and Materials

JF - Sensors and Materials

IS - 4

ER -

Image-similarity-based convolutional neural network for robot visual relocalization

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this