A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection

Ruiheng Zhang, Biwen Yang, Lixin Xu, Yan Huang, Xiaofeng Xu, Qi Zhang*, Zhizhuo Jiang*, Yu Liu

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

Infrared few-shot object detection (IFSOD) aims to detect infrared objects with limited labeled examples. Current infrared datasets, however, suffer from limited diversity in object types and classes, hindering robust evaluation of model generalization on novel classes. To systematically assess dataset quality, we propose metrics for class diversity, instance variability, and object density. By integrating three widely used infrared datasets, we construct the first dataset specifically tailored for IFSOD, increasing instance density to 4.8 (a 1.1 improvement) and expanding the number of classes to 18 (a 5-class increase) compared to the source datasets. Furthermore, frequency analysis of spatial features reveals that sparse annotations introduce spectral bias in the frequency domain. Directly transforming spatial features to the frequency domain, however, mixes background noise with object features, causing spectral leakage and impairing the learning of discriminative features for novel classes. To address these issues, we propose the frequency compression few-shot detection (FC-fsd) method, which incorporates a frequency compression (FC) module. The FC module leverages Discrete Cosine Transform (DCT) within localized windows to reduce spectral leakage and enhance feature clarity. With minimal additional computational overhead, FC-fsd significantly outperforms state-of-the-art methods, achieving nAP50 scores of 28.57 (+13.37) and 35.63 (+2.59) in 1-shot and 2-shot settings, respectively. Our dataset is published at https://github.com/RuihengZhang/IFSOD-dataset.

源语言英语
文章编号5001711
期刊IEEE Transactions on Geoscience and Remote Sensing
63
DOI
出版状态已出版 - 2025

指纹

探究 'A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此

Zhang, R., Yang, B., Xu, L., Huang, Y., Xu, X., Zhang, Q., Jiang, Z., & Liu, Y. (2025). A Benchmark and Frequency Compression Method for Infrared Few-Shot Object Detection. IEEE Transactions on Geoscience and Remote Sensing, 63, 文章 5001711. https://doi.org/10.1109/TGRS.2025.3540945