Boosting VLAD with weighted fusion of local descriptors for image retrieval

Hao Liu; Qingjie Zhao; Cong Zhang; Jimmy T. Mbelwa; Song Tang; Jianwei Zhang

doi:10.1007/s11042-018-6712-z

Boosting VLAD with weighted fusion of local descriptors for image retrieval

Hao Liu^*, Qingjie Zhao, Cong Zhang, Jimmy T. Mbelwa, Song Tang, Jianwei Zhang

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

In the last decade, many efforts have been developed for discriminative image representations. Among these works, vector of locally aggregated descriptors (VLAD) has been demonstrated to be an effective one. However, most VLAD-based methods generally employ detected SIFT descriptors and contain limited content information, in which the representation ability is deteriorated. In this work, we propose a novel framework to boost VLAD with weighted fusion of local descriptors (WF-VLAD), which encodes more discriminative clues and maintains higher performance. Toward a preferable image representation that contains sufficient details, our approach fuses SIFT sampled densely (dense SIFT) and detected from the interest points (detected SIFT) in the aggregation. Furthermore, we assign each detected SIFT corresponding weight that measured by saliency analysis to make the salient descriptors with relatively high importance. The proposed method can include sufficient image content information and highlight the important image regions. Finally, experiments on publicly available datasets demonstrate that our approach shows competitive performance in retrieval tasks.

Original language	English
Pages (from-to)	11835-11855
Number of pages	21
Journal	Multimedia Tools and Applications
Volume	78
Issue number	9
DOIs	https://doi.org/10.1007/s11042-018-6712-z
Publication status	Published - 1 May 2019

Keywords

Image representation
Image retrieval
Saliency weighting
VLAD

Access to Document

10.1007/s11042-018-6712-z

Cite this

Liu, H., Zhao, Q., Zhang, C., Mbelwa, J. T., Tang, S., & Zhang, J. (2019). Boosting VLAD with weighted fusion of local descriptors for image retrieval. Multimedia Tools and Applications, 78(9), 11835-11855. https://doi.org/10.1007/s11042-018-6712-z

@article{c8a7629726f44ef3960104ab43b5e6f0,

title = "Boosting VLAD with weighted fusion of local descriptors for image retrieval",

abstract = "In the last decade, many efforts have been developed for discriminative image representations. Among these works, vector of locally aggregated descriptors (VLAD) has been demonstrated to be an effective one. However, most VLAD-based methods generally employ detected SIFT descriptors and contain limited content information, in which the representation ability is deteriorated. In this work, we propose a novel framework to boost VLAD with weighted fusion of local descriptors (WF-VLAD), which encodes more discriminative clues and maintains higher performance. Toward a preferable image representation that contains sufficient details, our approach fuses SIFT sampled densely (dense SIFT) and detected from the interest points (detected SIFT) in the aggregation. Furthermore, we assign each detected SIFT corresponding weight that measured by saliency analysis to make the salient descriptors with relatively high importance. The proposed method can include sufficient image content information and highlight the important image regions. Finally, experiments on publicly available datasets demonstrate that our approach shows competitive performance in retrieval tasks.",

keywords = "Image representation, Image retrieval, Saliency weighting, VLAD",

author = "Hao Liu and Qingjie Zhao and Cong Zhang and Mbelwa, {Jimmy T.} and Song Tang and Jianwei Zhang",

note = "Publisher Copyright: {\textcopyright} 2018, Springer Science+Business Media, LLC, part of Springer Nature.",

year = "2019",

month = may,

day = "1",

doi = "10.1007/s11042-018-6712-z",

language = "English",

volume = "78",

pages = "11835--11855",

journal = "Multimedia Tools and Applications",

issn = "1380-7501",

publisher = "Springer",

number = "9",

}

TY - JOUR

T1 - Boosting VLAD with weighted fusion of local descriptors for image retrieval

AU - Liu, Hao

AU - Zhao, Qingjie

AU - Zhang, Cong

AU - Mbelwa, Jimmy T.

AU - Tang, Song

AU - Zhang, Jianwei

PY - 2019/5/1

Y1 - 2019/5/1

N2 - In the last decade, many efforts have been developed for discriminative image representations. Among these works, vector of locally aggregated descriptors (VLAD) has been demonstrated to be an effective one. However, most VLAD-based methods generally employ detected SIFT descriptors and contain limited content information, in which the representation ability is deteriorated. In this work, we propose a novel framework to boost VLAD with weighted fusion of local descriptors (WF-VLAD), which encodes more discriminative clues and maintains higher performance. Toward a preferable image representation that contains sufficient details, our approach fuses SIFT sampled densely (dense SIFT) and detected from the interest points (detected SIFT) in the aggregation. Furthermore, we assign each detected SIFT corresponding weight that measured by saliency analysis to make the salient descriptors with relatively high importance. The proposed method can include sufficient image content information and highlight the important image regions. Finally, experiments on publicly available datasets demonstrate that our approach shows competitive performance in retrieval tasks.

AB - In the last decade, many efforts have been developed for discriminative image representations. Among these works, vector of locally aggregated descriptors (VLAD) has been demonstrated to be an effective one. However, most VLAD-based methods generally employ detected SIFT descriptors and contain limited content information, in which the representation ability is deteriorated. In this work, we propose a novel framework to boost VLAD with weighted fusion of local descriptors (WF-VLAD), which encodes more discriminative clues and maintains higher performance. Toward a preferable image representation that contains sufficient details, our approach fuses SIFT sampled densely (dense SIFT) and detected from the interest points (detected SIFT) in the aggregation. Furthermore, we assign each detected SIFT corresponding weight that measured by saliency analysis to make the salient descriptors with relatively high importance. The proposed method can include sufficient image content information and highlight the important image regions. Finally, experiments on publicly available datasets demonstrate that our approach shows competitive performance in retrieval tasks.

KW - Image representation

KW - Image retrieval

KW - Saliency weighting

KW - VLAD

UR - http://www.scopus.com/inward/record.url?scp=85065701671&partnerID=8YFLogxK

U2 - 10.1007/s11042-018-6712-z

DO - 10.1007/s11042-018-6712-z

M3 - Article

AN - SCOPUS:85065701671

SN - 1380-7501

VL - 78

SP - 11835

EP - 11855

JO - Multimedia Tools and Applications

JF - Multimedia Tools and Applications

IS - 9

ER -

Boosting VLAD with weighted fusion of local descriptors for image retrieval

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this