TY - GEN
T1 - Multi-scale Vertical Cross-layer Feature Aggregation and Attention Fusion Network for Object Detection
AU - Gao, Wenting
AU - Li, Xiaojuan
AU - Han, Yu
AU - Liu, Yue
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - Scale imbalance is one of the primary limitations for object detection. To tackle such a problem, existing methods such as FPN usually integrate the features at different scales, which suffers from the inconsistence of different high-level and low-level features due to the straightforward combination. In this paper, we propose a multi-scale vertical cross-layer feature aggregation and attention fusion network which not only has bottom-up and top-down pathways with lateral connections, but also adds cross-layer paths in the vertical direction. The proposed method can boost information flow and shorten the information path between high-level and low-level features. An attention fusion module is also introduced to obtain the internal correlation between local, global and contextual information of other feature layers. In order to optimize the anchor configurations, a differential evolution algorithm is employed to reconfigure the ratios and scales of anchors. Experimental results show that the proposed method achieves superior detection performance on the public dataset PASCAL VOC.
AB - Scale imbalance is one of the primary limitations for object detection. To tackle such a problem, existing methods such as FPN usually integrate the features at different scales, which suffers from the inconsistence of different high-level and low-level features due to the straightforward combination. In this paper, we propose a multi-scale vertical cross-layer feature aggregation and attention fusion network which not only has bottom-up and top-down pathways with lateral connections, but also adds cross-layer paths in the vertical direction. The proposed method can boost information flow and shorten the information path between high-level and low-level features. An attention fusion module is also introduced to obtain the internal correlation between local, global and contextual information of other feature layers. In order to optimize the anchor configurations, a differential evolution algorithm is employed to reconfigure the ratios and scales of anchors. Experimental results show that the proposed method achieves superior detection performance on the public dataset PASCAL VOC.
KW - Attention mechanism
KW - Deep learning
KW - Object detection
UR - http://www.scopus.com/inward/record.url?scp=85138694564&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-15937-4_12
DO - 10.1007/978-3-031-15937-4_12
M3 - Conference contribution
AN - SCOPUS:85138694564
SN - 9783031159367
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 139
EP - 150
BT - Artificial Neural Networks and Machine Learning - ICANN 2022 - 31st International Conference on Artificial Neural Networks, Proceedings
A2 - Pimenidis, Elias
A2 - Aydin, Mehmet
A2 - Angelov, Plamen
A2 - Jayne, Chrisina
A2 - Papaleonidas, Antonios
PB - Springer Science and Business Media Deutschland GmbH
T2 - 31st International Conference on Artificial Neural Networks, ICANN 2022
Y2 - 6 September 2022 through 9 September 2022
ER -