跳到主要导航 跳到搜索 跳到主要内容

Can Categories and Attributes Be Learned in a Multi-Task Way?

  • Shu Yang
  • , Yaowei Wang
  • , Yemin Shi
  • , Zesong Fei*
  • *此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Intuitively, we can think of object recognition and attribute prediction as correlated tasks. However, they appeared to conflict in a simple two-branch multi-task framework (a category branch and an attribute branch) with a shared backbone part (convolutional layers and pooling layers). The performance dropped along with the iterative training steps. This result might have been caused by the noncoherent feature distribution between the object recognition features and the attribute prediction features. Recognition features are discriminative for different categories and are not sensitive to intracategory variations, while attribute prediction features are discriminative for different attributes, although these attributes can exist in objects from the same category. Thus, a conflict occurs when we force the network to learn the two kinds of distinct features simultaneously. To address this problem, we propose the category and attribute prediction network (CAP-net), in which a category-constrained attribute prediction structure is introduced to transfer the object recognition knowledge and avoid the conflict between two features. The CAP-net parameters can be learned easily with a regularization method. Extensive experimental results show that the CAP-net outperforms the state-of-the-art methods on object recognition and attribute prediction tasks.

源语言英语
文章编号8723592
页(从-至)3194-3204
页数11
期刊IEEE Transactions on Multimedia
21
12
DOI
出版状态已出版 - 12月 2019

指纹

探究 'Can Categories and Attributes Be Learned in a Multi-Task Way?' 的科研主题。它们共同构成独一无二的指纹。

引用此