Skip to main navigation Skip to search Skip to main content

Leveraging GAN Priors for Few-Shot Part Segmentation

  • Mengya Han
  • , Heliang Zheng
  • , Chaoyue Wang
  • , Yong Luo*
  • , Han Hu
  • , Bo Du*
  • *Corresponding author for this work
  • Wuhan University
  • JD Explore Academy

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Few-shot part segmentation aims to separate different parts of an object given only a few annotated samples. Due to the challenge of limited data, existing works mainly focus on learning classifiers over pre-Trained features, failing to learn task-specific features for part segmentation. In this paper, we propose to learn task-specific features in a "pre-Training"-"fine-Tuning"paradigm. We conduct prompt designing to reduce the gap between the pre-Train task (i.e., image generation) and the downstream task (i.e., part segmentation), so that the GAN priors for generation can be leveraged for segmentation. This is achieved by projecting part segmentation maps into the RGB space and conducting interpolation between RGB segmentation maps and original images. Specifically, we design a fine-Tuning strategy to progressively tune an image generator into a segmentation generator, where the supervision of the generator varying from images to segmentation maps by interpolation. Moreover, we propose a two-stream architecture, i.e., a segmentation stream to generate task-specific features, and an image stream to provide spatial constraints. The image stream can be regarded as a self-supervised auto-encoder, and this enables our model to benefit from large-scale support images. Overall, this work is an attempt to explore the internal relevance between generation tasks and perception tasks by prompt designing. Extensive experiments show that our model can achieve state-of-The-Art performance on several part segmentation datasets.

Original languageEnglish
Title of host publicationMM 2022 - Proceedings of the 30th ACM International Conference on Multimedia
PublisherAssociation for Computing Machinery, Inc
Pages1339-1347
Number of pages9
ISBN (Electronic)9781450392037
DOIs
Publication statusPublished - 10 Oct 2022
Event30th ACM International Conference on Multimedia, MM 2022 - Lisboa, Portugal
Duration: 10 Oct 202214 Oct 2022

Publication series

NameMM 2022 - Proceedings of the 30th ACM International Conference on Multimedia

Conference

Conference30th ACM International Conference on Multimedia, MM 2022
Country/TerritoryPortugal
CityLisboa
Period10/10/2214/10/22

Keywords

  • few-shot part segmentation
  • fine-Tuning
  • gan
  • multi-Task learning

Fingerprint

Dive into the research topics of 'Leveraging GAN Priors for Few-Shot Part Segmentation'. Together they form a unique fingerprint.

Cite this