LCNet: Location Combination for Object Detection

Xin Yi, Bo Ma

科研成果: 书/报告/会议事项章节会议稿件同行评审

1 引用 (Scopus)

摘要

Object detection is a widely studied task in the computer vision field. In recent years, some milestone approaches and solid benchmarks have been proposed, which significantly boosts the development of related researches. The previous object detection methods follow a paradigm: the classification head and the regression head share the same feature extracted by the backbone network. In this paper, we revisit this paradigm for two-stage detectors and prove that the regression head can achieve better results by using the local features. In our proposed Location Combination Networks (LCNet), we extract the effective region of the feature in a Laplace way, and we introduce auxiliary confidence gain loss, Intersection over Union (IoU) gain loss, and distribution loss to guide its convergence. In the classification head, we combine these local features into the global feature for better classification. In the regression head, by ranking these effective regions in the spatial dimension, we can select the local features closest to each foreground boundary and use the selected features to predict the offset of each foreground boundary. Finally, we combine the locations of the four boundaries to obtain the final bounding box prediction. Extensive experimental results on the MS COCO benchmark validate the effectiveness of our proposed method.

源语言英语
主期刊名ICDSP 2022 - 2022 6th International Conference on Digital Signal Processing
出版商Association for Computing Machinery
152-158
页数7
ISBN(电子版)9781450395809
DOI
出版状态已出版 - 25 2月 2022
活动6th International Conference on Digital Signal Processing, ICDSP 2022 - Virtual, Online, 中国
期限: 25 2月 202227 2月 2022

出版系列

姓名ACM International Conference Proceeding Series
Par F180471

会议

会议6th International Conference on Digital Signal Processing, ICDSP 2022
国家/地区中国
Virtual, Online
时期25/02/2227/02/22

指纹

探究 'LCNet: Location Combination for Object Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此