面向跨模态检索的自监督深度语义保持Hash

Bo Lu; Xiaodong Duan; Ye Yuan

doi:10.16511/j.cnki.qhdxxb.2021.26.040

面向跨模态检索的自监督深度语义保持Hash

Bo Lu, Xiaodong Duan, Ye Yuan

计算机学院

Dalian Minzu University

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

The key issue for cross-modal retrieval using cross-modal Hashing is how to maximize the consistency of the semantic relationship for heterogeneous media data. This paper presents a self-supervised deep semantics-preserving hashing network (UDSPH) that generates compact Hash codes using an end-to-end architecture. Two modality-specific hashing networks are first trained for generating the Hash codes and high-level features. The semantic relationship hetween different modalities is then measured using cross-modal attention mechanisms that maximize preservation of the local semantic correlation. Multi-label semantic information in the training data is used to simultaneously guide the training of two modality-specific Hashing networks by self-supervised adversarial learning. This constructs a deep semantic hashing network that preserves the semantic association in the global view and improves the discriminative capability of the generated Hash codes. Tests on three widely-used benchmark datasets verify the effectiveness of this method.

投稿的翻译标题	Self-supervised deep semantics-preserving Hashing for cross-modal retrieval
源语言	繁体中文
页（从-至）	1442-1449
页数	8
期刊	Qinghua Daxue Xuebao/Journal of Tsinghua University
卷	62
期	9
DOI	https://doi.org/10.16511/j.cnki.qhdxxb.2021.26.040
出版状态	已出版 - 15 9月 2022

关键词

adversarial learning
cross-modal attention
deep cross-modal Hashing
semantic Hashing

访问文件

10.16511/j.cnki.qhdxxb.2021.26.040

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{1a24d00c72d9474a8445911fc1db11fa,

title = "面向跨模态检索的自监督深度语义保持Hash",

abstract = "The key issue for cross-modal retrieval using cross-modal Hashing is how to maximize the consistency of the semantic relationship for heterogeneous media data. This paper presents a self-supervised deep semantics-preserving hashing network (UDSPH) that generates compact Hash codes using an end-to-end architecture. Two modality-specific hashing networks are first trained for generating the Hash codes and high-level features. The semantic relationship hetween different modalities is then measured using cross-modal attention mechanisms that maximize preservation of the local semantic correlation. Multi-label semantic information in the training data is used to simultaneously guide the training of two modality-specific Hashing networks by self-supervised adversarial learning. This constructs a deep semantic hashing network that preserves the semantic association in the global view and improves the discriminative capability of the generated Hash codes. Tests on three widely-used benchmark datasets verify the effectiveness of this method.",

keywords = "adversarial learning, cross-modal attention, deep cross-modal Hashing, semantic Hashing",

author = "Bo Lu and Xiaodong Duan and Ye Yuan",

year = "2022",

month = sep,

day = "15",

doi = "10.16511/j.cnki.qhdxxb.2021.26.040",

language = "繁体中文",

volume = "62",

pages = "1442--1449",

journal = "Qinghua Daxue Xuebao/Journal of Tsinghua University",

issn = "1000-0054",

publisher = "Tsinghua University Press",

number = "9",

}

TY - JOUR

T1 - 面向跨模态检索的自监督深度语义保持Hash

AU - Lu, Bo

AU - Duan, Xiaodong

AU - Yuan, Ye

PY - 2022/9/15

Y1 - 2022/9/15

N2 - The key issue for cross-modal retrieval using cross-modal Hashing is how to maximize the consistency of the semantic relationship for heterogeneous media data. This paper presents a self-supervised deep semantics-preserving hashing network (UDSPH) that generates compact Hash codes using an end-to-end architecture. Two modality-specific hashing networks are first trained for generating the Hash codes and high-level features. The semantic relationship hetween different modalities is then measured using cross-modal attention mechanisms that maximize preservation of the local semantic correlation. Multi-label semantic information in the training data is used to simultaneously guide the training of two modality-specific Hashing networks by self-supervised adversarial learning. This constructs a deep semantic hashing network that preserves the semantic association in the global view and improves the discriminative capability of the generated Hash codes. Tests on three widely-used benchmark datasets verify the effectiveness of this method.

AB - The key issue for cross-modal retrieval using cross-modal Hashing is how to maximize the consistency of the semantic relationship for heterogeneous media data. This paper presents a self-supervised deep semantics-preserving hashing network (UDSPH) that generates compact Hash codes using an end-to-end architecture. Two modality-specific hashing networks are first trained for generating the Hash codes and high-level features. The semantic relationship hetween different modalities is then measured using cross-modal attention mechanisms that maximize preservation of the local semantic correlation. Multi-label semantic information in the training data is used to simultaneously guide the training of two modality-specific Hashing networks by self-supervised adversarial learning. This constructs a deep semantic hashing network that preserves the semantic association in the global view and improves the discriminative capability of the generated Hash codes. Tests on three widely-used benchmark datasets verify the effectiveness of this method.

KW - adversarial learning

KW - cross-modal attention

KW - deep cross-modal Hashing

KW - semantic Hashing

UR - http://www.scopus.com/inward/record.url?scp=85129340456&partnerID=8YFLogxK

U2 - 10.16511/j.cnki.qhdxxb.2021.26.040

DO - 10.16511/j.cnki.qhdxxb.2021.26.040

M3 - 文章

AN - SCOPUS:85129340456

SN - 1000-0054

VL - 62

SP - 1442

EP - 1449

JO - Qinghua Daxue Xuebao/Journal of Tsinghua University

JF - Qinghua Daxue Xuebao/Journal of Tsinghua University

IS - 9

ER -

面向跨模态检索的自监督深度语义保持Hash

摘要

关键词

访问文件

其它文件与链接

指纹

引用此