面向跨模态检索的自监督深度语义保持Hash

Bo Lu; Xiaodong Duan; Ye Yuan

doi:10.16511/j.cnki.qhdxxb.2021.26.040

面向跨模态检索的自监督深度语义保持Hash

Translated title of the contribution: Self-supervised deep semantics-preserving Hashing for cross-modal retrieval

Bo Lu, Xiaodong Duan, Ye Yuan

School of Computer Science and Technology

Dalian Minzu University

Research output: Contribution to journal › Article › peer-review

Abstract

The key issue for cross-modal retrieval using cross-modal Hashing is how to maximize the consistency of the semantic relationship for heterogeneous media data. This paper presents a self-supervised deep semantics-preserving hashing network (UDSPH) that generates compact Hash codes using an end-to-end architecture. Two modality-specific hashing networks are first trained for generating the Hash codes and high-level features. The semantic relationship hetween different modalities is then measured using cross-modal attention mechanisms that maximize preservation of the local semantic correlation. Multi-label semantic information in the training data is used to simultaneously guide the training of two modality-specific Hashing networks by self-supervised adversarial learning. This constructs a deep semantic hashing network that preserves the semantic association in the global view and improves the discriminative capability of the generated Hash codes. Tests on three widely-used benchmark datasets verify the effectiveness of this method.

Translated title of the contribution	Self-supervised deep semantics-preserving Hashing for cross-modal retrieval
Original language	Chinese (Traditional)
Pages (from-to)	1442-1449
Number of pages	8
Journal	Qinghua Daxue Xuebao/Journal of Tsinghua University
Volume	62
Issue number	9
DOIs	https://doi.org/10.16511/j.cnki.qhdxxb.2021.26.040
Publication status	Published - 15 Sept 2022

Access to Document

10.16511/j.cnki.qhdxxb.2021.26.040

Cite this

@article{1a24d00c72d9474a8445911fc1db11fa,

title = "面向跨模态检索的自监督深度语义保持Hash",

abstract = "The key issue for cross-modal retrieval using cross-modal Hashing is how to maximize the consistency of the semantic relationship for heterogeneous media data. This paper presents a self-supervised deep semantics-preserving hashing network (UDSPH) that generates compact Hash codes using an end-to-end architecture. Two modality-specific hashing networks are first trained for generating the Hash codes and high-level features. The semantic relationship hetween different modalities is then measured using cross-modal attention mechanisms that maximize preservation of the local semantic correlation. Multi-label semantic information in the training data is used to simultaneously guide the training of two modality-specific Hashing networks by self-supervised adversarial learning. This constructs a deep semantic hashing network that preserves the semantic association in the global view and improves the discriminative capability of the generated Hash codes. Tests on three widely-used benchmark datasets verify the effectiveness of this method.",

keywords = "adversarial learning, cross-modal attention, deep cross-modal Hashing, semantic Hashing",

author = "Bo Lu and Xiaodong Duan and Ye Yuan",

year = "2022",

month = sep,

day = "15",

doi = "10.16511/j.cnki.qhdxxb.2021.26.040",

language = "繁体中文",

volume = "62",

pages = "1442--1449",

journal = "Qinghua Daxue Xuebao/Journal of Tsinghua University",

issn = "1000-0054",

publisher = "Tsinghua University Press",

number = "9",

}

TY - JOUR

T1 - 面向跨模态检索的自监督深度语义保持Hash

AU - Lu, Bo

AU - Duan, Xiaodong

AU - Yuan, Ye

PY - 2022/9/15

Y1 - 2022/9/15

N2 - The key issue for cross-modal retrieval using cross-modal Hashing is how to maximize the consistency of the semantic relationship for heterogeneous media data. This paper presents a self-supervised deep semantics-preserving hashing network (UDSPH) that generates compact Hash codes using an end-to-end architecture. Two modality-specific hashing networks are first trained for generating the Hash codes and high-level features. The semantic relationship hetween different modalities is then measured using cross-modal attention mechanisms that maximize preservation of the local semantic correlation. Multi-label semantic information in the training data is used to simultaneously guide the training of two modality-specific Hashing networks by self-supervised adversarial learning. This constructs a deep semantic hashing network that preserves the semantic association in the global view and improves the discriminative capability of the generated Hash codes. Tests on three widely-used benchmark datasets verify the effectiveness of this method.

AB - The key issue for cross-modal retrieval using cross-modal Hashing is how to maximize the consistency of the semantic relationship for heterogeneous media data. This paper presents a self-supervised deep semantics-preserving hashing network (UDSPH) that generates compact Hash codes using an end-to-end architecture. Two modality-specific hashing networks are first trained for generating the Hash codes and high-level features. The semantic relationship hetween different modalities is then measured using cross-modal attention mechanisms that maximize preservation of the local semantic correlation. Multi-label semantic information in the training data is used to simultaneously guide the training of two modality-specific Hashing networks by self-supervised adversarial learning. This constructs a deep semantic hashing network that preserves the semantic association in the global view and improves the discriminative capability of the generated Hash codes. Tests on three widely-used benchmark datasets verify the effectiveness of this method.

KW - adversarial learning

KW - cross-modal attention

KW - deep cross-modal Hashing

KW - semantic Hashing

UR - http://www.scopus.com/inward/record.url?scp=85129340456&partnerID=8YFLogxK

U2 - 10.16511/j.cnki.qhdxxb.2021.26.040

DO - 10.16511/j.cnki.qhdxxb.2021.26.040

M3 - 文章

AN - SCOPUS:85129340456

SN - 1000-0054

VL - 62

SP - 1442

EP - 1449

JO - Qinghua Daxue Xuebao/Journal of Tsinghua University

JF - Qinghua Daxue Xuebao/Journal of Tsinghua University

IS - 9

ER -

面向跨模态检索的自监督深度语义保持Hash

Abstract

Access to Document

Other files and links

Fingerprint

Cite this