CNN- Transformer-Based Modeling and Visual Measurement of Compound Eye Vision System

Shangwu Feng, Li Yang, Yuan Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In some special environments, vision measurement systems need to be miniaturized, lightweight and have a large field of view. The compound eye vision system satisfies these features in small-range close-up measurements. In this paper, an 8 ∗ 8 compound eye array is built so that a compound eye vision measurement system can be established to obtain the world coordinates of the image and the target. The traditional approach is model-based vision system modeling, which is very dependent on the accuracy of the model. This is very difficult in compound-eye vision systems with a large degree of nonlinearity. In this paper, a SubHarris-based feature point extraction method is designed. A new data structure based on the extracted feature points is constructed for the feature that not all subeyes are imaged, and a neural network calibration method based on CNN - Transformer is designed to make the model more focused on the regions with images. The results show that the MAE of the method using deep learning improves by 19.7% relative to the basic neural network. The length measurement error is improved by 15.7% at 30-80 mm. It is also found in the experiments that the designed or improved modules SPP, RoIPool, VggBlock, CBAM and Transfomer encoding block in the convolutional structure all have a boosting effect on the final error results.

源语言英语
主期刊名2023 42nd Chinese Control Conference, CCC 2023
出版商IEEE Computer Society
7572-7577
页数6
ISBN(电子版)9789887581543
DOI
出版状态已出版 - 2023
活动42nd Chinese Control Conference, CCC 2023 - Tianjin, 中国
期限: 24 7月 202326 7月 2023

出版系列

姓名Chinese Control Conference, CCC
2023-July
ISSN(印刷版)1934-1768
ISSN(电子版)2161-2927

会议

会议42nd Chinese Control Conference, CCC 2023
国家/地区中国
Tianjin
时期24/07/2326/07/23

指纹

探究 'CNN- Transformer-Based Modeling and Visual Measurement of Compound Eye Vision System' 的科研主题。它们共同构成独一无二的指纹。

引用此