Dual-YOLO Architecture from Infrared and Visible Images for Object Detection

Chun Bao; Jie Cao; Qun Hao; Yang Cheng; Yaqian Ning; Tianhua Zhao

doi:10.3390/s23062934

Dual-YOLO Architecture from Infrared and Visible Images for Object Detection

Chun Bao, Jie Cao, Qun Hao, Yang Cheng, Yaqian Ning, Tianhua Zhao

School of Optics and Photonics

Research output: Contribution to journal › Article › peer-review

26 Citations (Scopus)

Abstract

With the development of infrared detection technology and the improvement of military remote sensing needs, infrared object detection networks with low false alarms and high detection accuracy have been a research focus. However, due to the lack of texture information, the false detection rate of infrared object detection is high, resulting in reduced object detection accuracy. To solve these problems, we propose an infrared object detection network named Dual-YOLO, which integrates visible image features. To ensure the speed of model detection, we choose the You Only Look Once v7 (YOLOv7) as the basic framework and design the infrared and visible images dual feature extraction channels. In addition, we develop attention fusion and fusion shuffle modules to reduce the detection error caused by redundant fusion feature information. Moreover, we introduce the Inception and SE modules to enhance the complementary characteristics of infrared and visible images. Furthermore, we design the fusion loss function to make the network converge fast during training. The experimental results show that the proposed Dual-YOLO network reaches 71.8% mean Average Precision (mAP) in the DroneVehicle remote sensing dataset and 73.2% mAP in the KAIST pedestrian dataset. The detection accuracy reaches 84.5% in the FLIR dataset. The proposed architecture is expected to be applied in the fields of military reconnaissance, unmanned driving, and public safety.

Original language	English
Journal	Sensors
Volume	23
Issue number	6
DOIs	https://doi.org/10.3390/s23062934
Publication status	Published - 8 Mar 2023

Keywords

attention fusion
dual-YOLO
fusion loss
fusion shuffle
infrared object detection

Access to Document

10.3390/s23062934

Cite this

@article{06745cccab784cb68325293b9cfd57b1,

title = "Dual-YOLO Architecture from Infrared and Visible Images for Object Detection",

abstract = "With the development of infrared detection technology and the improvement of military remote sensing needs, infrared object detection networks with low false alarms and high detection accuracy have been a research focus. However, due to the lack of texture information, the false detection rate of infrared object detection is high, resulting in reduced object detection accuracy. To solve these problems, we propose an infrared object detection network named Dual-YOLO, which integrates visible image features. To ensure the speed of model detection, we choose the You Only Look Once v7 (YOLOv7) as the basic framework and design the infrared and visible images dual feature extraction channels. In addition, we develop attention fusion and fusion shuffle modules to reduce the detection error caused by redundant fusion feature information. Moreover, we introduce the Inception and SE modules to enhance the complementary characteristics of infrared and visible images. Furthermore, we design the fusion loss function to make the network converge fast during training. The experimental results show that the proposed Dual-YOLO network reaches 71.8% mean Average Precision (mAP) in the DroneVehicle remote sensing dataset and 73.2% mAP in the KAIST pedestrian dataset. The detection accuracy reaches 84.5% in the FLIR dataset. The proposed architecture is expected to be applied in the fields of military reconnaissance, unmanned driving, and public safety.",

keywords = "attention fusion, dual-YOLO, fusion loss, fusion shuffle, infrared object detection",

author = "Chun Bao and Jie Cao and Qun Hao and Yang Cheng and Yaqian Ning and Tianhua Zhao",

year = "2023",

month = mar,

day = "8",

doi = "10.3390/s23062934",

language = "English",

volume = "23",

journal = "Sensors",

issn = "1424-8220",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "6",

}

TY - JOUR

T1 - Dual-YOLO Architecture from Infrared and Visible Images for Object Detection

AU - Bao, Chun

AU - Cao, Jie

AU - Hao, Qun

AU - Cheng, Yang

AU - Ning, Yaqian

AU - Zhao, Tianhua

PY - 2023/3/8

Y1 - 2023/3/8

N2 - With the development of infrared detection technology and the improvement of military remote sensing needs, infrared object detection networks with low false alarms and high detection accuracy have been a research focus. However, due to the lack of texture information, the false detection rate of infrared object detection is high, resulting in reduced object detection accuracy. To solve these problems, we propose an infrared object detection network named Dual-YOLO, which integrates visible image features. To ensure the speed of model detection, we choose the You Only Look Once v7 (YOLOv7) as the basic framework and design the infrared and visible images dual feature extraction channels. In addition, we develop attention fusion and fusion shuffle modules to reduce the detection error caused by redundant fusion feature information. Moreover, we introduce the Inception and SE modules to enhance the complementary characteristics of infrared and visible images. Furthermore, we design the fusion loss function to make the network converge fast during training. The experimental results show that the proposed Dual-YOLO network reaches 71.8% mean Average Precision (mAP) in the DroneVehicle remote sensing dataset and 73.2% mAP in the KAIST pedestrian dataset. The detection accuracy reaches 84.5% in the FLIR dataset. The proposed architecture is expected to be applied in the fields of military reconnaissance, unmanned driving, and public safety.

AB - With the development of infrared detection technology and the improvement of military remote sensing needs, infrared object detection networks with low false alarms and high detection accuracy have been a research focus. However, due to the lack of texture information, the false detection rate of infrared object detection is high, resulting in reduced object detection accuracy. To solve these problems, we propose an infrared object detection network named Dual-YOLO, which integrates visible image features. To ensure the speed of model detection, we choose the You Only Look Once v7 (YOLOv7) as the basic framework and design the infrared and visible images dual feature extraction channels. In addition, we develop attention fusion and fusion shuffle modules to reduce the detection error caused by redundant fusion feature information. Moreover, we introduce the Inception and SE modules to enhance the complementary characteristics of infrared and visible images. Furthermore, we design the fusion loss function to make the network converge fast during training. The experimental results show that the proposed Dual-YOLO network reaches 71.8% mean Average Precision (mAP) in the DroneVehicle remote sensing dataset and 73.2% mAP in the KAIST pedestrian dataset. The detection accuracy reaches 84.5% in the FLIR dataset. The proposed architecture is expected to be applied in the fields of military reconnaissance, unmanned driving, and public safety.

KW - attention fusion

KW - dual-YOLO

KW - fusion loss

KW - fusion shuffle

KW - infrared object detection

UR - http://www.scopus.com/inward/record.url?scp=85151199429&partnerID=8YFLogxK

U2 - 10.3390/s23062934

DO - 10.3390/s23062934

M3 - Article

C2 - 36991645

AN - SCOPUS:85151199429

SN - 1424-8220

VL - 23

JO - Sensors

JF - Sensors

IS - 6

ER -

Dual-YOLO Architecture from Infrared and Visible Images for Object Detection

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this