TY - JOUR
T1 - Hierarchical context-agnostic network with contrastive feature diversity for one-shot semantic segmentation
AU - Fang, Zhiyuan
AU - Gao, Guangyu
AU - Zhang, Zekang
AU - Zhang, Anqi
N1 - Publisher Copyright:
© 2023
PY - 2023/2
Y1 - 2023/2
N2 - One-shot semantic segmentation aims at distinguishing pixels of an unseen category from the background, using merely one annotated image from the same category. However, most previous works neglect the feature diversity of foreground and the context information of background by using the masked average pooling. To solve these issues, we propose the Hierarchical Context-Agnostic Network (HCNet). It mainly includes two modules: (1) a Hierarchical Pyramid Supportive (HPS) module that generate the hierarchical supportive prototypes from coarse to fine to ensure feature diversity, and (2) a Background Exclusion Supportive (BES) module that explicitly introduces the contrastive information from the background for more precise category features. We conduct extensive experiments on Pascal-5i and COCO-20i to evaluate the performance of HCNet. HCNet achieves the mIoU score of 62.1% on Pascal-5i and 40.7% on COCO-20i and outperforms other works for the challenging one-shot segmentation, which has proved the efficiency of the whole network. Code is available at https://github.com/fangzy97/hcnet.
AB - One-shot semantic segmentation aims at distinguishing pixels of an unseen category from the background, using merely one annotated image from the same category. However, most previous works neglect the feature diversity of foreground and the context information of background by using the masked average pooling. To solve these issues, we propose the Hierarchical Context-Agnostic Network (HCNet). It mainly includes two modules: (1) a Hierarchical Pyramid Supportive (HPS) module that generate the hierarchical supportive prototypes from coarse to fine to ensure feature diversity, and (2) a Background Exclusion Supportive (BES) module that explicitly introduces the contrastive information from the background for more precise category features. We conduct extensive experiments on Pascal-5i and COCO-20i to evaluate the performance of HCNet. HCNet achieves the mIoU score of 62.1% on Pascal-5i and 40.7% on COCO-20i and outperforms other works for the challenging one-shot segmentation, which has proved the efficiency of the whole network. Code is available at https://github.com/fangzy97/hcnet.
KW - Background exclusion
KW - Few-shot learning
KW - Hierarchical pyramid
KW - Semantic segmentation
KW - Unsupervised clustering
UR - http://www.scopus.com/inward/record.url?scp=85145978134&partnerID=8YFLogxK
U2 - 10.1016/j.jvcir.2023.103754
DO - 10.1016/j.jvcir.2023.103754
M3 - Article
AN - SCOPUS:85145978134
SN - 1047-3203
VL - 90
JO - Journal of Visual Communication and Image Representation
JF - Journal of Visual Communication and Image Representation
M1 - 103754
ER -