TY - JOUR
T1 - Camouflage soldier object detection network based on the attention mechanism and pyramidal feature shrinking
AU - Peng, Yiguo
AU - Wang, Jianzhong
AU - Yu, Zibo
AU - You, Yu
AU - Sun, Yong
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
PY - 2024
Y1 - 2024
N2 - Due to the high level of information similarity between camouflage soldier objects and their background, traditional deep learning-based object detection networks encounter distinct error detection rates and miss detection rates when attempting to detect camouflage soldiers. To address these challenges, we proposed a camouflage soldier object detection network (AFSNet) based on attention mechanism and multi-scale feature fusion strategy. We employed an attention module to enhance the network’s capability for feature extraction. Furthermore, we proposed a novel strategy for multi-scale feature fusion based on pyramidal feature shrinking, aiming to mitigate interference caused by interpolation and prevent information loss resulting from pooling during the process of feature fusion. Moreover, we introduced a novel information handle module that enhances the network’s capability for feature fusion by regulating the information transmission pathway. Experiments demonstrated that our network exhibits a better camouflage object detection performance than state-of-arts networks. Compared to YOLOv7, our network can achieve 93% AP, which is increased by 6.7% with almost no computation overhead.
AB - Due to the high level of information similarity between camouflage soldier objects and their background, traditional deep learning-based object detection networks encounter distinct error detection rates and miss detection rates when attempting to detect camouflage soldiers. To address these challenges, we proposed a camouflage soldier object detection network (AFSNet) based on attention mechanism and multi-scale feature fusion strategy. We employed an attention module to enhance the network’s capability for feature extraction. Furthermore, we proposed a novel strategy for multi-scale feature fusion based on pyramidal feature shrinking, aiming to mitigate interference caused by interpolation and prevent information loss resulting from pooling during the process of feature fusion. Moreover, we introduced a novel information handle module that enhances the network’s capability for feature fusion by regulating the information transmission pathway. Experiments demonstrated that our network exhibits a better camouflage object detection performance than state-of-arts networks. Compared to YOLOv7, our network can achieve 93% AP, which is increased by 6.7% with almost no computation overhead.
KW - Attention mechanism
KW - Camouflage object detection
KW - Pyramidal feature shrinking
UR - http://www.scopus.com/inward/record.url?scp=85186417316&partnerID=8YFLogxK
U2 - 10.1007/s11042-024-18618-w
DO - 10.1007/s11042-024-18618-w
M3 - Article
AN - SCOPUS:85186417316
SN - 1380-7501
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
ER -