Multi-level Proposal Relations Aggregation for Video Object Detection

Chongkai Yu, Wenjie Chen*, Bing Wu

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Video information often deteriorates in certain frames, which is a great challenge for object detection. It is difficult to identify the object in this frame by just utilizing the information of one frame. Recently, plenty of studies have shown that context aggregating information through the self-attention mechanism can enhance the features in key frames. However, these methods only exploit some of inter-video and intra-video global-local information, not all of it. Global semantic and local localization information in the same video can assist object classification and regression. The intra-proposal relation among different videos can provide important cues to distinguish confusing objects. All of this information is able to enhance the performance of video object detection. In this paper, we design a Multi-Level Proposal Relations Aggregation network to mine inter-video and intra-video global-local pro-posal relations. For intra-video, we effectively aggregate global and local information to augments the proposal features of key frames. For inter-video, we aggregate the inter-video key frame features to the target video under the constraint of relation regularization. We flexibly utilize the relation module to aggregate the proposals from different frames. Experiments on ImageNet VID dataset demonstrate the effectiveness of our method.

源语言英语
主期刊名Artificial Neural Networks and Machine Learning - ICANN 2022 - 31st International Conference on Artificial Neural Networks, Proceedings
编辑Elias Pimenidis, Mehmet Aydin, Plamen Angelov, Chrisina Jayne, Antonios Papaleonidas
出版商Springer Science and Business Media Deutschland GmbH
734-745
页数12
ISBN(印刷版)9783031159183
DOI
出版状态已出版 - 2022
活动31st International Conference on Artificial Neural Networks, ICANN 2022 - Bristol, 英国
期限: 6 9月 20229 9月 2022

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13529 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议31st International Conference on Artificial Neural Networks, ICANN 2022
国家/地区英国
Bristol
时期6/09/229/09/22

指纹

探究 'Multi-level Proposal Relations Aggregation for Video Object Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此