Collaborative detection of infrared targets in the overlapping region of four-aperture fields of view based on the residual local pyramid attention network

  • Siyuan Zhao*
  • , Lin Luo
  • , Weiqi Jin
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, aiming at the requirement of model lightweighting due to the excessively large input size of four-aperture infrared images, a Residual Local Pyramid Attention Network (RLPANet) for infrared small target detection is proposed based on the Dense Nested Attention Network. The number of parameters and computational complexity of model inference are reduced through the multiplexing and fusion of global-local attention and the segmentation of feature maps. Meanwhile, the detection rate (Pd) and intersection over union (IoU) levels are maintained through the optimization of residual connections and the redesign of the Weighted Dice-BCE (WDB) loss function that refers to the human eye imaging mode. During the detection stage, the internal and external parameter matrices after the calibration of the four-aperture camera are used to establish the connection of overlapping field-of-view images. After obtaining the target center coordinates of the sub-field of view, the target center positions of the remaining fields of view are calculated through reprojection. The full-pixel detection of the sub-field of view is transformed into a 50×50 detection window, thus greatly reducing the image size required for target detection. This method solves the problem of slow model inference speed caused by the excessively large size of four-aperture infrared images, and verifies the time efficiency and stability of the lightweight module on multiple infrared small target datasets and image sequences collected from actual outdoor scenes.

Original languageEnglish
Title of host publicationAOPC 2025
Subtitle of host publicationInfrared and Terahertz Technology and Applications
EditorsXue Li, Xin Tang
PublisherSPIE
ISBN (Electronic)9781510698581
DOIs
Publication statusPublished - 28 Oct 2025
Externally publishedYes
EventAOPC 2025: Infrared and Terahertz Technology and Applications - Beijing, China
Duration: 24 Jun 202527 Jun 2025

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume13957
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

ConferenceAOPC 2025: Infrared and Terahertz Technology and Applications
Country/TerritoryChina
CityBeijing
Period24/06/2527/06/25

Keywords

  • Attention Network Structure
  • Four-aperture Bionic Compound Eye
  • Infrared Small Target Detection
  • Residual Local Pyramid Module

Fingerprint

Dive into the research topics of 'Collaborative detection of infrared targets in the overlapping region of four-aperture fields of view based on the residual local pyramid attention network'. Together they form a unique fingerprint.

Cite this