TY - GEN
T1 - Leveraging GAN Priors for Few-Shot Part Segmentation
AU - Han, Mengya
AU - Zheng, Heliang
AU - Wang, Chaoyue
AU - Luo, Yong
AU - Hu, Han
AU - Du, Bo
N1 - Publisher Copyright:
© 2022 ACM.
PY - 2022/10/10
Y1 - 2022/10/10
N2 - Few-shot part segmentation aims to separate different parts of an object given only a few annotated samples. Due to the challenge of limited data, existing works mainly focus on learning classifiers over pre-Trained features, failing to learn task-specific features for part segmentation. In this paper, we propose to learn task-specific features in a "pre-Training"-"fine-Tuning"paradigm. We conduct prompt designing to reduce the gap between the pre-Train task (i.e., image generation) and the downstream task (i.e., part segmentation), so that the GAN priors for generation can be leveraged for segmentation. This is achieved by projecting part segmentation maps into the RGB space and conducting interpolation between RGB segmentation maps and original images. Specifically, we design a fine-Tuning strategy to progressively tune an image generator into a segmentation generator, where the supervision of the generator varying from images to segmentation maps by interpolation. Moreover, we propose a two-stream architecture, i.e., a segmentation stream to generate task-specific features, and an image stream to provide spatial constraints. The image stream can be regarded as a self-supervised auto-encoder, and this enables our model to benefit from large-scale support images. Overall, this work is an attempt to explore the internal relevance between generation tasks and perception tasks by prompt designing. Extensive experiments show that our model can achieve state-of-The-Art performance on several part segmentation datasets.
AB - Few-shot part segmentation aims to separate different parts of an object given only a few annotated samples. Due to the challenge of limited data, existing works mainly focus on learning classifiers over pre-Trained features, failing to learn task-specific features for part segmentation. In this paper, we propose to learn task-specific features in a "pre-Training"-"fine-Tuning"paradigm. We conduct prompt designing to reduce the gap between the pre-Train task (i.e., image generation) and the downstream task (i.e., part segmentation), so that the GAN priors for generation can be leveraged for segmentation. This is achieved by projecting part segmentation maps into the RGB space and conducting interpolation between RGB segmentation maps and original images. Specifically, we design a fine-Tuning strategy to progressively tune an image generator into a segmentation generator, where the supervision of the generator varying from images to segmentation maps by interpolation. Moreover, we propose a two-stream architecture, i.e., a segmentation stream to generate task-specific features, and an image stream to provide spatial constraints. The image stream can be regarded as a self-supervised auto-encoder, and this enables our model to benefit from large-scale support images. Overall, this work is an attempt to explore the internal relevance between generation tasks and perception tasks by prompt designing. Extensive experiments show that our model can achieve state-of-The-Art performance on several part segmentation datasets.
KW - few-shot part segmentation
KW - fine-Tuning
KW - gan
KW - multi-Task learning
UR - https://www.scopus.com/pages/publications/85151128504
U2 - 10.1145/3503161.3548398
DO - 10.1145/3503161.3548398
M3 - Conference contribution
AN - SCOPUS:85151128504
T3 - MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia
SP - 1339
EP - 1347
BT - MM 2022 - Proceedings of the 30th ACM International Conference on Multimedia
PB - Association for Computing Machinery, Inc
T2 - 30th ACM International Conference on Multimedia, MM 2022
Y2 - 10 October 2022 through 14 October 2022
ER -