CapsNet based on Encoder and Decoder for Object Detection

Man Luo; Xin Wang; Hongbin Ma

doi:10.1109/ICMA49215.2020.9233658

CapsNet based on Encoder and Decoder for Object Detection

Man Luo, Xin Wang, Hongbin Ma

自动化学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

The recently proposed capsule network (CapsNet) can learn the hierarchy relationships of entity features and realize the equivariance to affine transformations, which makes the capsule architecture more promising for object detection. In this paper, based on capsule architecture, we create the CapsNet-V1 models for object detection. The proposed CapsNetV1 mainly consists of the classification net as encoder to extract multi-class information and the reconstruction net as decoder to obtain masks with multi-object position information. In the experiments, based on the randomly expanded MNIST dataset, we simultaneously evaluate the multi-object classification and reconstruction abilities of the proposed CapsNet. The results indicate that our capsule models can reconstruct the object masks with accurate location information at correct labels, which exactly demonstrates the feasibility of using capsule networks for object detection. Further, our CapsNet can be widely applied to the multi-object detection with simple backgrounds in the industrial production lines.

源语言	英语
主期刊名	2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020
出版商	Institute of Electrical and Electronics Engineers Inc.
页	1112-1117
页数	6
ISBN（电子版）	9781728164151
DOI	https://doi.org/10.1109/ICMA49215.2020.9233658
出版状态	已出版 - 13 10月 2020
活动	17th IEEE International Conference on Mechatronics and Automation, ICMA 2020 - Beijing, 中国期限: 13 10月 2020 → 16 10月 2020

出版系列

姓名	2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020

会议

会议	17th IEEE International Conference on Mechatronics and Automation, ICMA 2020
国家/地区	中国
市	Beijing
时期	13/10/20 → 16/10/20

访问文件

10.1109/ICMA49215.2020.9233658

其它文件与链接

链接到 Scopus 的出版物

引用此

Luo, M., Wang, X., & Ma, H. (2020). CapsNet based on Encoder and Decoder for Object Detection. 在 2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020 (页码 1112-1117). 文章 9233658 (2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICMA49215.2020.9233658

@inproceedings{74627d700c534cbfa73eb7eb106de913,

title = "CapsNet based on Encoder and Decoder for Object Detection",

abstract = "The recently proposed capsule network (CapsNet) can learn the hierarchy relationships of entity features and realize the equivariance to affine transformations, which makes the capsule architecture more promising for object detection. In this paper, based on capsule architecture, we create the CapsNet-V1 models for object detection. The proposed CapsNetV1 mainly consists of the classification net as encoder to extract multi-class information and the reconstruction net as decoder to obtain masks with multi-object position information. In the experiments, based on the randomly expanded MNIST dataset, we simultaneously evaluate the multi-object classification and reconstruction abilities of the proposed CapsNet. The results indicate that our capsule models can reconstruct the object masks with accurate location information at correct labels, which exactly demonstrates the feasibility of using capsule networks for object detection. Further, our CapsNet can be widely applied to the multi-object detection with simple backgrounds in the industrial production lines.",

keywords = "capsule networks, classification encoder, dynamic routing algorithm, expanded MNIST dataset, reconstruction decoder",

author = "Man Luo and Xin Wang and Hongbin Ma",

note = "Publisher Copyright: {\textcopyright} 2020 IEEE.; 17th IEEE International Conference on Mechatronics and Automation, ICMA 2020 ; Conference date: 13-10-2020 Through 16-10-2020",

year = "2020",

month = oct,

day = "13",

doi = "10.1109/ICMA49215.2020.9233658",

language = "English",

series = "2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1112--1117",

booktitle = "2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020",

address = "United States",

}

Luo, M, Wang, X & Ma, H 2020, CapsNet based on Encoder and Decoder for Object Detection. 在 2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020., 9233658, 2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020, Institute of Electrical and Electronics Engineers Inc., 页码 1112-1117, 17th IEEE International Conference on Mechatronics and Automation, ICMA 2020, Beijing, 中国, 13/10/20. https://doi.org/10.1109/ICMA49215.2020.9233658

CapsNet based on Encoder and Decoder for Object Detection. / Luo, Man; Wang, Xin; Ma, Hongbin.
2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020. Institute of Electrical and Electronics Engineers Inc., 2020. 页码 1112-1117 9233658 (2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - CapsNet based on Encoder and Decoder for Object Detection

AU - Luo, Man

AU - Wang, Xin

AU - Ma, Hongbin

PY - 2020/10/13

Y1 - 2020/10/13

N2 - The recently proposed capsule network (CapsNet) can learn the hierarchy relationships of entity features and realize the equivariance to affine transformations, which makes the capsule architecture more promising for object detection. In this paper, based on capsule architecture, we create the CapsNet-V1 models for object detection. The proposed CapsNetV1 mainly consists of the classification net as encoder to extract multi-class information and the reconstruction net as decoder to obtain masks with multi-object position information. In the experiments, based on the randomly expanded MNIST dataset, we simultaneously evaluate the multi-object classification and reconstruction abilities of the proposed CapsNet. The results indicate that our capsule models can reconstruct the object masks with accurate location information at correct labels, which exactly demonstrates the feasibility of using capsule networks for object detection. Further, our CapsNet can be widely applied to the multi-object detection with simple backgrounds in the industrial production lines.

AB - The recently proposed capsule network (CapsNet) can learn the hierarchy relationships of entity features and realize the equivariance to affine transformations, which makes the capsule architecture more promising for object detection. In this paper, based on capsule architecture, we create the CapsNet-V1 models for object detection. The proposed CapsNetV1 mainly consists of the classification net as encoder to extract multi-class information and the reconstruction net as decoder to obtain masks with multi-object position information. In the experiments, based on the randomly expanded MNIST dataset, we simultaneously evaluate the multi-object classification and reconstruction abilities of the proposed CapsNet. The results indicate that our capsule models can reconstruct the object masks with accurate location information at correct labels, which exactly demonstrates the feasibility of using capsule networks for object detection. Further, our CapsNet can be widely applied to the multi-object detection with simple backgrounds in the industrial production lines.

KW - capsule networks

KW - classification encoder

KW - dynamic routing algorithm

KW - expanded MNIST dataset

KW - reconstruction decoder

UR - http://www.scopus.com/inward/record.url?scp=85096584767&partnerID=8YFLogxK

U2 - 10.1109/ICMA49215.2020.9233658

DO - 10.1109/ICMA49215.2020.9233658

M3 - Conference contribution

AN - SCOPUS:85096584767

T3 - 2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020

SP - 1112

EP - 1117

BT - 2020 IEEE International Conference on Mechatronics and Automation, ICMA 2020

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 17th IEEE International Conference on Mechatronics and Automation, ICMA 2020

Y2 - 13 October 2020 through 16 October 2020

ER -

CapsNet based on Encoder and Decoder for Object Detection

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此