Multi-scale object detection by top-down and bottom-up feature pyramid network

Zhao Baojun*, Zhao Boya, Tang Linbo, Wang Wenzheng, Wu Chen

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

25 引用 (Scopus)

摘要

While moving ahead with the object detection technology, especially deep neural networks, many related tasks, such as medical application and industrial automation, have achieved great success. However, the detection of objects with multiple aspect ratios and scales is still a key problem. This paper proposes a top-down and bottom-up feature pyramid network (TDBU-FPN), which combines multi-scale feature representation and anchor generation at multiple aspect ratios. First, in order to build the multi-scale feature map, this paper puts a number of fully convolutional layers after the backbone. Second, to link neighboring feature maps, top-down and bottom-up flows are adopted to introduce context information via top-down flow and supplement sub-original information via bottom-up flow. The top-down flow refers to the deconvolution procedure, and the bottom-up flow refers to the pooling procedure. Third, the problem of adapting different object aspect ratios is tackled via many anchor shapes with different aspect ratios on each multi-scale feature map. The proposed method is evaluated on the pattern analysis, statistical modeling and computational learning visual object classes (PASCAL VOC) dataset and reaches an accuracy of 79%, which exhibits a 1.8% improvement with a detection speed of 23 fps.

源语言英语
页(从-至)1-12
页数12
期刊Journal of Systems Engineering and Electronics
30
1
DOI
出版状态已出版 - 2月 2019

指纹

探究 'Multi-scale object detection by top-down and bottom-up feature pyramid network' 的科研主题。它们共同构成独一无二的指纹。

引用此