Target detection based on multi-scale feature fusion and cross-channel interactive attention mechanism

Chenyang Zhao, Yong Song*, Xin Yang, Ya Zhou*, Jinqi Yang

*此作品的通讯作者

科研成果: 期刊稿件会议文章同行评审

3 引用 (Scopus)
Plum Print visual indicator of research metrics
  • Citations
    • Citation Indexes: 3
  • Captures
    • Readers: 1
see details

摘要

Aiming at the problems of complex background, target scale change and small target in aerial image detection, we propose a YOLOv5 target detection algorithm based on multi-scale feature fusion and cross-channel interactive attention mechanism. Including: M-PPM (Multi-scale pyramid pooling module) is designed as a replacement for the SPP (Spatial Pyramid Pooling) structure in YOLOv5, so as to make full use of different scale features to fuse global feature information; CCA (Cross-channel interactive attention mechanism) is designed to realize cross-channel information interaction and utilization, and enhance the network's capability to generalize and fusion efficiency of small target features. Bi-directional Feature Pyramid Network (BiFPN) is utilized to solve scale difference problem in multi-target detection. The proposed algorithm's experimental results is 2.3 % and 1.8 % higher than YOLOv5 on the VisDrone and UAVDT aerial data sets, respectively.

源语言英语
文章编号012046
期刊Journal of Physics: Conference Series
2562
1
DOI
出版状态已出版 - 2023
活动2023 3rd International Conference on Artificial Intelligence and Industrial Technology Applications, AIITA 2023 - Suzhou, 中国
期限: 24 3月 202326 3月 2023

指纹

探究 'Target detection based on multi-scale feature fusion and cross-channel interactive attention mechanism' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhao, C., Song, Y., Yang, X., Zhou, Y., & Yang, J. (2023). Target detection based on multi-scale feature fusion and cross-channel interactive attention mechanism. Journal of Physics: Conference Series, 2562(1), 文章 012046. https://doi.org/10.1088/1742-6596/2562/1/012046