Implicit relative attribute enabled cross-modality hashing for face image-video retrieval

Peng Dai, Xue Wang*, Weihang Zhang, Pengbo Zhang, Wei You

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Face image-video retrieval refers to retrieving videos of a specific person with image query or searching face images of one person by using a video clip query. It has attracted much attention for broad applications like suspect tracking and identifying. This paper proposes a novel implicit relative attribute enabled cross-modality hashing (IRAH) method for large-scale face image-video retrieval. To cope with large-scale data, the proposed IRAH method facilitates fast cross-modality retrieval through embedding two entirely heterogeneous spaces, i.e., face images in Euclidean space and face videos on a Riemannian manifold, into a unified compact Hamming space. In order to resolve the semantic gap, IRAH maps the original low-level kernelized features to discriminative high-level implicit relative attributes. Therefore, the retrieval accuracy can be improved by leveraging both the label information across different modalities and the semantic structure obtained from the implicit relative attributes in each modality. To evaluate the proposed method, we conduct extensive experiments on two publicly available databases, i.e., the Big Bang Theory (BBT) and Buffy the Vampire Slayer (BVS). The experimental results demonstrate the superiority of the proposed method over different state-of-the-art cross-modality hashing methods. The performance gains are especially significant in the case that the hash code length is 8 bits, up to 12% improvements over the second best method among tested methods.

Original languageEnglish
Pages (from-to)23547-23577
Number of pages31
JournalMultimedia Tools and Applications
Volume77
Issue number18
DOIs
Publication statusPublished - 1 Sept 2018
Externally publishedYes

Keywords

  • Cross-modality similarity search
  • Face image-video retrieval
  • Hashing
  • Human attribute

Fingerprint

Dive into the research topics of 'Implicit relative attribute enabled cross-modality hashing for face image-video retrieval'. Together they form a unique fingerprint.

Cite this