Towards low bit rate mobile visual search with multiple-channel coding

Rongrong Ji*, Ling Yu Duan, Jie Chen, Hongxun Yao, Yong Rui, Shih Fu Chang, Wen Gao

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

46 Citations (Scopus)

Abstract

In this paper, we propose a multiple-channel coding scheme to extract compact visual descriptors for low bit rate mobile visual search. Different from previous visual search scenarios that send the query image, we make use of the ever growing mobile computational capability to directly extract compact visual descriptors at the mobile end. Meanwhile, stepping forward from the state-of-the-art compact descriptor extractions, we exploit the rich contextual cues at the mobile end (such as GPS tags for mobile visual search and 2D barcodes or RFID tags for mobile product search), together with the visual statistics at the reference database, to learn multiple coding channels. Therefore, we describe the query with one of many forms of high-dimensional visual signature, which is subsequently mapped to one or more channels and compressed. The compression function within each channel is learnt based on a novel robust PCA scheme, with specific consideration to preserve the retrieval ranking capability of the original signature. We have deployed our scheme on both iPhone4 and HTC DESIRE 7 to search ten million landmark images in a low bit rate setting. Quantitative comparisons to the state-of-the-arts demonstrate our significant advantages in descriptor compactness (with orders of magnitudes improvement) and retrieval mAP in mobile landmark, product, and CD/book cover search.

Original languageEnglish
Title of host publicationMM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops
Pages573-582
Number of pages10
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11 - Scottsdale, AZ, United States
Duration: 28 Nov 20111 Dec 2011

Publication series

NameMM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops

Conference

Conference19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11
Country/TerritoryUnited States
CityScottsdale, AZ
Period28/11/111/12/11

Keywords

  • Compact descriptor
  • Contextual learning
  • Data compression
  • Mobile visual search
  • Wireless communication

Fingerprint

Dive into the research topics of 'Towards low bit rate mobile visual search with multiple-channel coding'. Together they form a unique fingerprint.

Cite this