TY - JOUR
T1 - A Multi-View features hinged siamese U-Net for image Co-segmentation
AU - Li, Yushuo
AU - Liu, Xiabi
AU - Gong, Xiaopeng
AU - Wang, Murong
N1 - Publisher Copyright:
© 2020, Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2021/6
Y1 - 2021/6
N2 - This paper proposes a new U-shape structure to extract multi-view features from different images which is incorporated into the network that can be trained end-to-end for image co-segmentation task. The multi-view features integrate global correlations between images, so we can segment the common objects in different images from the features directly. Before getting the multi-view features, we extract the deep features of input images through two weights-shared streams. Then we get pixel-level similarity maps through a similarity layer from deep features. The whole architecture is a Siamese U-net hinged by multi-view features, called iMFNet for short. We further introduce Dice loss and employ both positive and negative examples to train the whole network. Furthermore, a learnable conditional random field (CRF) layer is added to iMFNet for more accurate results. Using the training data from MSRC and PASCAL VOC 2012 datasets, the iMFNet achieves the state-of-the-art performance on the Internet datasets and the competitive performance on the iCoseg datasets.
AB - This paper proposes a new U-shape structure to extract multi-view features from different images which is incorporated into the network that can be trained end-to-end for image co-segmentation task. The multi-view features integrate global correlations between images, so we can segment the common objects in different images from the features directly. Before getting the multi-view features, we extract the deep features of input images through two weights-shared streams. Then we get pixel-level similarity maps through a similarity layer from deep features. The whole architecture is a Siamese U-net hinged by multi-view features, called iMFNet for short. We further introduce Dice loss and employ both positive and negative examples to train the whole network. Furthermore, a learnable conditional random field (CRF) layer is added to iMFNet for more accurate results. Using the training data from MSRC and PASCAL VOC 2012 datasets, the iMFNet achieves the state-of-the-art performance on the Internet datasets and the competitive performance on the iCoseg datasets.
KW - Image co-segmentation
KW - Multi-view features
KW - Siamese U-Net
UR - http://www.scopus.com/inward/record.url?scp=85082859062&partnerID=8YFLogxK
U2 - 10.1007/s11042-020-08794-w
DO - 10.1007/s11042-020-08794-w
M3 - Article
AN - SCOPUS:85082859062
SN - 1380-7501
VL - 80
SP - 22965
EP - 22985
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 15
ER -