Enhancing few-shot object detection through mixing and separating tuning strategies

Zhengquan Piao, Fuyong Feng, Ruina Dang, Wenzheng Wang, Shichao Zhou, Yuqi Han*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Abstract: Few-shot object detection (FSOD) aims to detect novel objects using only a limited number of labelled examples. Existing fine-tuning-based FSOD methods typically face challenges in effectively transferring knowledge from base to novel categories, often leading to confusion between them. To address this issue, we propose a novel mixing and separating tuning (MST) framework. In the mixing tuning stage, we pretune the model using transitional samples between base and novel categories to reduce bias towards the base category. Subsequently, in the separating tuning stage, we further fine-tune the model on novel category samples with an auxiliary discrimination network and an energy-based separation strategy. Extensive experimental results on PASCAL VOC and Microsoft COCO benchmarks demonstrate that our MST framework significantly outperforms existing state-of-the-art methods, achieving better discrimination and separation between base and novel categories. The proposed approach not only improves detection performance on novel categories but also maintains high accuracy on base categories. The code is available at: https://github.com/zqpiao/MS_FSOD.

Original languageEnglish
JournalVisual Computer
DOIs
Publication statusAccepted/In press - 2025
Externally publishedYes

Keywords

  • Energy-based model
  • Few-shot learning
  • Fine-tuning
  • Object detection

Cite this