YOLOv8-FA: Multimodal Fusion Image Object Detection Algorithm

Zhitao Hong, Jing Li*, Wenyu Hu, Junzheng Wang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

To address the issues of weak information expression capability of single-modal images and the limited receptive field of traditional object detection algorithms, we propose the YOLOv8-FA algorithm based on the YOLOv8 algorithm. In the backbone feature extraction network, the FasterNetBlock module is introduced to reduce the redundancy of model feature channels. Additionally, a depthwise separable convolution module is incorporated to enhance the receptive field of the network model's feature extraction. The loss function is improved to WIou to enhance the network's adaptive adjustment capability to the quality of input images. The training dataset is obtained by fusing visible light and infrared images based on the LLVIP dataset. Subjective and objective experimental results indicate that the proposed algorithm effectively improves the model's detection performance, reduces false detections and missed detections, and achieves object detection tasks in multimodal fused images.

Original languageEnglish
Title of host publication2024 IEEE 3rd Industrial Electronics Society Annual On-Line Conference, ONCON 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331540319
DOIs
Publication statusPublished - 2024
Externally publishedYes
Event3rd IEEE Industrial Electronics Society Annual On-Line Conference, ONCON 2024 - Beijing, China
Duration: 8 Dec 202410 Dec 2024

Publication series

Name2024 IEEE 3rd Industrial Electronics Society Annual On-Line Conference, ONCON 2024

Conference

Conference3rd IEEE Industrial Electronics Society Annual On-Line Conference, ONCON 2024
Country/TerritoryChina
CityBeijing
Period8/12/2410/12/24

Keywords

  • Depthwise separable convolution
  • Multimodal images
  • Object detection

Fingerprint

Dive into the research topics of 'YOLOv8-FA: Multimodal Fusion Image Object Detection Algorithm'. Together they form a unique fingerprint.

Cite this