Deep Heterogeneous Multi-Task Metric Learning for Visual Recognition and Retrieval

Shikang Gan, Yong Luo, Yonggang Wen, Tongliang Liu, Han Hu*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

How to estimate the distance between data instances is a fundamental problem in many artificial intelligence algorithms, and critical in diverse multimedia applications. A major challenge in the estimation is how to find an appropriate distance function when labeled data are insufficient for a certain task. Multi-task metric learning (MTML) is able to alleviate such data deficiency issue by learning distance metrics for multiple tasks together and sharing information between the different tasks. Recently, heterogeneous MTML (HMTML) has attracted much attention since it can handle multiple tasks with varied data representations. A major drawback of the current HMTML approaches is that only linear transformations are learned to connect different domains. This is suboptimal since the correlations between different domains may be very complex and highly nonlinear. To overcome this drawback, we propose a deep heterogeneous MTML (DHMTML) method, in which a nonlinear mapping is learned for each task by using a deep neural network. The correlations of different domains are exploited by sharing some parameters at the top layers of different networks. More importantly, the auto-encoder scheme and the adversarial learning mechanism are integrated and incorporated to help exploit the feature correlations in and between different tasks and the specific properties are preserved by learning additional task-specific layers together with the common layers. Experiments demonstrated that the proposed method outperforms single-task deep metric learning algorithms and other HMTML approaches consistently on several benchmark datasets.

源语言英语
主期刊名MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia
出版商Association for Computing Machinery, Inc
1837-1845
页数9
ISBN(电子版)9781450379885
DOI
出版状态已出版 - 12 10月 2020
活动28th ACM International Conference on Multimedia, MM 2020 - Virtual, Online, 美国
期限: 12 10月 202016 10月 2020

出版系列

姓名MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia

会议

会议28th ACM International Conference on Multimedia, MM 2020
国家/地区美国
Virtual, Online
时期12/10/2016/10/20

指纹

探究 'Deep Heterogeneous Multi-Task Metric Learning for Visual Recognition and Retrieval' 的科研主题。它们共同构成独一无二的指纹。

引用此