TY - JOUR
T1 - Dual attention based feature pyramid network
AU - Xing, Huijun
AU - Wang, Shuai
AU - Zheng, Dezhi
AU - Zhao, Xiaotong
N1 - Publisher Copyright:
© 2013 China Institute of Communications.
PY - 2020/8
Y1 - 2020/8
N2 - Object detection could be recognized as an essential part of the research to scenarios such as automatic driving and pedestrian detection, etc. Among multiple types of target objects, the identification of small-scale objects faces significant challenges. We would introduce a new feature pyramid framework called Dual Attention based Feature Pyramid Network (DAFPN), which is designed to avoid predicament about multi-scale object recognition. In DAFPN, the attention mechanism is introduced by calculating the top-down pathway and lateral pathway, where the spatial attention, as well as channel attention, would participate, respectively, such that the pyramidal feature maps can be generated with enhanced spatial and channel interdependencies, which bring more semantical information for the feature pyramid. Using the COCO data set, which consists of a considerable quantity of small-scale objects, the experiments are implemented. The analysis results verify the optimized performance of DAFPN compared with the original Feature Pyramid Network (FPN) specifically for the identification on a small scale. The proposed DAFPN is promising for object detection in an era full of intelligent machines that need to detect multi-scale objects.
AB - Object detection could be recognized as an essential part of the research to scenarios such as automatic driving and pedestrian detection, etc. Among multiple types of target objects, the identification of small-scale objects faces significant challenges. We would introduce a new feature pyramid framework called Dual Attention based Feature Pyramid Network (DAFPN), which is designed to avoid predicament about multi-scale object recognition. In DAFPN, the attention mechanism is introduced by calculating the top-down pathway and lateral pathway, where the spatial attention, as well as channel attention, would participate, respectively, such that the pyramidal feature maps can be generated with enhanced spatial and channel interdependencies, which bring more semantical information for the feature pyramid. Using the COCO data set, which consists of a considerable quantity of small-scale objects, the experiments are implemented. The analysis results verify the optimized performance of DAFPN compared with the original Feature Pyramid Network (FPN) specifically for the identification on a small scale. The proposed DAFPN is promising for object detection in an era full of intelligent machines that need to detect multi-scale objects.
KW - convolutional neural networks
KW - feature pyramid
KW - object detection
UR - http://www.scopus.com/inward/record.url?scp=85094196507&partnerID=8YFLogxK
U2 - 10.23919/JCC.2020.08.020
DO - 10.23919/JCC.2020.08.020
M3 - Article
AN - SCOPUS:85094196507
SN - 1673-5447
VL - 17
SP - 242
EP - 252
JO - China Communications
JF - China Communications
IS - 8
M1 - 9190145
ER -