Multi-Scale Object Detection Using Feature Fusion Recalibration Network

Ziyuan Guo; Weimin Zhang; Zhenshuo Liang; Yongliang Shi; Qiang Huang

doi:10.1109/ACCESS.2020.2980737

Multi-Scale Object Detection Using Feature Fusion Recalibration Network

Ziyuan Guo, Weimin Zhang^*, Zhenshuo Liang, Yongliang Shi, Qiang Huang

^*Corresponding author for this work

School of Mechatronical Engineering

Research output: Contribution to journal › Article › peer-review

4 Citations (Scopus)

Abstract

In this paper, the object detection algorithm based on deep learning running on the robot platform is studied and optimized. The p has high requirements for the detection efficiency and scale invariance of the algorithm. In order to improve the detection accuracy on all scales and keep the balance between speed and accuracy, we propose the following methods: Aiming at the problem of low detection accuracy of object detection algorithm for scale changing objects, the traditional image pyramid technology of computer vision is used to verify its effectiveness in improving the detection accuracy of the algorithm for scale changing objects. Then, by embedding the image pyramid into the network, the memory consumption caused by the traditional pyramid is reduced, and the detection accuracy of the algorithm for different scale objects is improved. A new feature fusion recalibration structure is designed. Feature fusion can fuse the low-level location information and high-level semantic information. The recalibration assigns the importance weight of the channel of the feature maps. This structure can effectively improve the detection accuracy of the algorithm at all scales without losing too much speed. We apply these two structures to YOLO. The accuracy of the improved algorithm has a significant improvement and the algorithm can run at 16 FPS on a TITAN Xp GPU.

Original language	English
Article number	9035489
Pages (from-to)	51664-51673
Number of pages	10
Journal	IEEE Access
Volume	8
DOIs	https://doi.org/10.1109/ACCESS.2020.2980737
Publication status	Published - 2020

Keywords

Multi-scale object detection
convolutional neural network
feature fusion
feature recalibration

Access to Document

10.1109/ACCESS.2020.2980737

Cite this

@article{4d28737b72354e65a4c0b157a52fa165,

title = "Multi-Scale Object Detection Using Feature Fusion Recalibration Network",

abstract = "In this paper, the object detection algorithm based on deep learning running on the robot platform is studied and optimized. The p has high requirements for the detection efficiency and scale invariance of the algorithm. In order to improve the detection accuracy on all scales and keep the balance between speed and accuracy, we propose the following methods: Aiming at the problem of low detection accuracy of object detection algorithm for scale changing objects, the traditional image pyramid technology of computer vision is used to verify its effectiveness in improving the detection accuracy of the algorithm for scale changing objects. Then, by embedding the image pyramid into the network, the memory consumption caused by the traditional pyramid is reduced, and the detection accuracy of the algorithm for different scale objects is improved. A new feature fusion recalibration structure is designed. Feature fusion can fuse the low-level location information and high-level semantic information. The recalibration assigns the importance weight of the channel of the feature maps. This structure can effectively improve the detection accuracy of the algorithm at all scales without losing too much speed. We apply these two structures to YOLO. The accuracy of the improved algorithm has a significant improvement and the algorithm can run at 16 FPS on a TITAN Xp GPU.",

keywords = "Multi-scale object detection, convolutional neural network, feature fusion, feature recalibration",

author = "Ziyuan Guo and Weimin Zhang and Zhenshuo Liang and Yongliang Shi and Qiang Huang",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2020",

doi = "10.1109/ACCESS.2020.2980737",

language = "English",

volume = "8",

pages = "51664--51673",

journal = "IEEE Access",

issn = "2169-3536",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Multi-Scale Object Detection Using Feature Fusion Recalibration Network

AU - Guo, Ziyuan

AU - Zhang, Weimin

AU - Liang, Zhenshuo

AU - Shi, Yongliang

AU - Huang, Qiang

PY - 2020

Y1 - 2020

N2 - In this paper, the object detection algorithm based on deep learning running on the robot platform is studied and optimized. The p has high requirements for the detection efficiency and scale invariance of the algorithm. In order to improve the detection accuracy on all scales and keep the balance between speed and accuracy, we propose the following methods: Aiming at the problem of low detection accuracy of object detection algorithm for scale changing objects, the traditional image pyramid technology of computer vision is used to verify its effectiveness in improving the detection accuracy of the algorithm for scale changing objects. Then, by embedding the image pyramid into the network, the memory consumption caused by the traditional pyramid is reduced, and the detection accuracy of the algorithm for different scale objects is improved. A new feature fusion recalibration structure is designed. Feature fusion can fuse the low-level location information and high-level semantic information. The recalibration assigns the importance weight of the channel of the feature maps. This structure can effectively improve the detection accuracy of the algorithm at all scales without losing too much speed. We apply these two structures to YOLO. The accuracy of the improved algorithm has a significant improvement and the algorithm can run at 16 FPS on a TITAN Xp GPU.

AB - In this paper, the object detection algorithm based on deep learning running on the robot platform is studied and optimized. The p has high requirements for the detection efficiency and scale invariance of the algorithm. In order to improve the detection accuracy on all scales and keep the balance between speed and accuracy, we propose the following methods: Aiming at the problem of low detection accuracy of object detection algorithm for scale changing objects, the traditional image pyramid technology of computer vision is used to verify its effectiveness in improving the detection accuracy of the algorithm for scale changing objects. Then, by embedding the image pyramid into the network, the memory consumption caused by the traditional pyramid is reduced, and the detection accuracy of the algorithm for different scale objects is improved. A new feature fusion recalibration structure is designed. Feature fusion can fuse the low-level location information and high-level semantic information. The recalibration assigns the importance weight of the channel of the feature maps. This structure can effectively improve the detection accuracy of the algorithm at all scales without losing too much speed. We apply these two structures to YOLO. The accuracy of the improved algorithm has a significant improvement and the algorithm can run at 16 FPS on a TITAN Xp GPU.

KW - Multi-scale object detection

KW - convolutional neural network

KW - feature fusion

KW - feature recalibration

UR - http://www.scopus.com/inward/record.url?scp=85082533525&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2020.2980737

DO - 10.1109/ACCESS.2020.2980737

M3 - Article

AN - SCOPUS:85082533525

SN - 2169-3536

VL - 8

SP - 51664

EP - 51673

JO - IEEE Access

JF - IEEE Access

M1 - 9035489

ER -

Multi-Scale Object Detection Using Feature Fusion Recalibration Network

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this