CapsNet based on Encoder and Decoder for Object Detection

Man Luo, Xin Wang, Hongbin Ma

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The recently proposed capsule network (CapsNet) can learn the hierarchy relationships of entity features and realize the equivariance to affine transformations, which makes the capsule architecture more promising for object detection. In this paper, based on capsule architecture, we create the CapsNet-V1 models for object detection. The proposed CapsNetV1 mainly consists of the classification net as encoder to extract multi-class information and the reconstruction net as decoder to obtain masks with multi-object position information. In the experiments, based on the randomly expanded MNIST dataset, we simultaneously evaluate the multi-object classification and reconstruction abilities of the proposed CapsNet. The results indicate that our capsule models can reconstruct the object masks with accurate location information at correct labels, which exactly demonstrates the feasibility of using capsule networks for object detection. Further, our CapsNet can be widely applied to the multi-object detection with simple backgrounds in the industrial production lines.

源语言英语
主期刊名2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020
出版商Institute of Electrical and Electronics Engineers Inc.
1112-1117
页数6
ISBN(电子版)9781728164151
DOI
出版状态已出版 - 13 10月 2020
活动17th IEEE International Conference on Mechatronics and Automation, ICMA 2020 - Beijing, 中国
期限: 13 10月 202016 10月 2020

出版系列

姓名2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020

会议

会议17th IEEE International Conference on Mechatronics and Automation, ICMA 2020
国家/地区中国
Beijing
时期13/10/2016/10/20

指纹

探究 'CapsNet based on Encoder and Decoder for Object Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此