Single-Frame Infrared Small Target Detection With Dynamic Multidimensional Convolution

Shichao Zhou, Zekai Zhang, Yingrui Zhao, Wenzheng Wang*, Zhuowei Wang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Mainly resulting from remote imaging, the target of interest in infrared imagery tends to occupy very few pixels with faint radiation value. The absence of discriminative spatial features of infrared small targets challenges traditional single-frame detectors that rely on handcrafted filter engineering to amplify local contrast. Recently, emerging deep convolutional network (DCN)-based detectors use elaborate multiscale spatial contexts representation to “semantically reason” the small and dim infrared target in pixel level. However, the multiple spatial convolution-downsampling operation adopted by such leading methods could cause the loss of target appearance information during the initial feature encoding stage. To further enhance the low-level feature representation capacity, we advocate the insight of traditional matching filter and propose a novel pixel-adaptive convolution kernel modulated by multidimensional contexts (i.e., dynamic multidimensional convolution, DMConv). Precisely, the DMConv is refined by three collaborative and indispensable attention functions that focus on spatial layout, channel, and kernel number of convolution kernel, respectively, so as to effectively mine, highlight, and enrich fine-grained spatial features with moderate computational burden. Extensive experiments conducted on two real-world infrared single-frame image datasets, i.e., SIRST and Infrared Small Target Detection (IRSTD)-1k, favorably demonstrate the effectiveness of the proposed method and obtain consistent performance improvements over other state-of-the-art (SOTA) detectors.

Original languageEnglish
JournalIEEE Geoscience and Remote Sensing Letters
Volume22
DOIs
Publication statusPublished - 2025
Externally publishedYes

Keywords

  • Adaptive filters
  • convolutional neural network
  • infrared image
  • multidimensional information fusion

Fingerprint

Dive into the research topics of 'Single-Frame Infrared Small Target Detection With Dynamic Multidimensional Convolution'. Together they form a unique fingerprint.

Cite this