Towards low bit rate mobile visual search with multiple-channel coding

Rongrong Ji*, Ling Yu Duan, Jie Chen, Hongxun Yao, Yong Rui, Shih Fu Chang, Wen Gao

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

46 引用 (Scopus)

摘要

In this paper, we propose a multiple-channel coding scheme to extract compact visual descriptors for low bit rate mobile visual search. Different from previous visual search scenarios that send the query image, we make use of the ever growing mobile computational capability to directly extract compact visual descriptors at the mobile end. Meanwhile, stepping forward from the state-of-the-art compact descriptor extractions, we exploit the rich contextual cues at the mobile end (such as GPS tags for mobile visual search and 2D barcodes or RFID tags for mobile product search), together with the visual statistics at the reference database, to learn multiple coding channels. Therefore, we describe the query with one of many forms of high-dimensional visual signature, which is subsequently mapped to one or more channels and compressed. The compression function within each channel is learnt based on a novel robust PCA scheme, with specific consideration to preserve the retrieval ranking capability of the original signature. We have deployed our scheme on both iPhone4 and HTC DESIRE 7 to search ten million landmark images in a low bit rate setting. Quantitative comparisons to the state-of-the-arts demonstrate our significant advantages in descriptor compactness (with orders of magnitudes improvement) and retrieval mAP in mobile landmark, product, and CD/book cover search.

源语言英语
主期刊名MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops
573-582
页数10
DOI
出版状态已出版 - 2011
已对外发布
活动19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11 - Scottsdale, AZ, 美国
期限: 28 11月 20111 12月 2011

出版系列

姓名MM'11 - Proceedings of the 2011 ACM Multimedia Conference and Co-Located Workshops

会议

会议19th ACM International Conference on Multimedia ACM Multimedia 2011, MM'11
国家/地区美国
Scottsdale, AZ
时期28/11/111/12/11

指纹

探究 'Towards low bit rate mobile visual search with multiple-channel coding' 的科研主题。它们共同构成独一无二的指纹。

引用此