ASA-Net: Deep representation learning between object silhouette and attributes

Shu Yang, Jing Wang*, Lidong Yang, Zesong Fei

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Object silhouette and semantic attributes are respectively verified for their effectiveness as auxiliary supervision in image recognition. It encourages us to propose a novel zero-shot recognition model, Attribute-Segmentation-Attribute network (ASA-Net), which jointly conducts object segmentation, attribute prediction and recognition in a multi-task learning manner. Firstly, a feature extraction module is pre-trained based on smooth attribute and category annotations. This module is then adopted to initialize the feature encoding module of a multi-scale segmentation CNN to generate coarse-to-fine object silhouettes. Finally, the segments are multiplied with the original image to obtain regions of interest, and semantic features of these regions are extracted and combined to predict attributes. The obtained attribute prediction is further projected into the category space to accomplish the zero-shot recognition task. Experimental results on two public benchmarks indicate that our ASA-Net performs better than baseline and existing methods in attribute prediction and segmentation tasks, as well as the unseen object recognition. The source code is publicly available online (https://github.com/YsSue/ASA-Net.git).

源语言英语
页(从-至)189-199
页数11
期刊Neurocomputing
503
DOI
出版状态已出版 - 7 9月 2022

指纹

探究 'ASA-Net: Deep representation learning between object silhouette and attributes' 的科研主题。它们共同构成独一无二的指纹。

引用此