Hashing based Efficient Inference for Image-Text Matching

Rong Cheng Tu, Lei Ji*, Huaishao Luo, Botian Shi, Heyan Huang, Nan Duan, Xian Ling Mao

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

8 引用 (Scopus)

摘要

Image-text matching has been a popular research topic which bridges vision and language through semantic understanding. Recent works mainly focus on exploring the interactions between images and sentences to improve the performance without considering inference efficiency. Specifically, for the large scale databases, it is unacceptable to perform such time-consuming mechanisms between a query (text/image) and each candidate datapoint (image/text) in the whole retrieval set during inference. To tackle this problem, we propose a novel hashing based efficient inference module called HEI, which can be plugged into the existing framework to speed up the inference step without reducing the retrieval performance. In details, HEI learns to map the original datapoints into short binary hash codes and coarsely preserve the heterologous matching relationship. Thus, in the inference phase, the proposed HEI module uses the hash codes to quickly select a few candidate datapoints from the retrieval set for a given query. Then, the image-text matching model fine ranks the candidate set to find the matching datapoint. Extensive experiments on two widely used benchmark MS-COCO and Flickr30k with four baseline methods demonstrate the efficiency and effectiveness of our proposed HEI module.

源语言英语
主期刊名Findings of the Association for Computational Linguistics
主期刊副标题ACL-IJCNLP 2021
编辑Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli
出版商Association for Computational Linguistics (ACL)
743-752
页数10
ISBN(电子版)9781954085541
出版状态已出版 - 2021
活动Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 - Virtual, Online
期限: 1 8月 20216 8月 2021

出版系列

姓名Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

会议

会议Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
Virtual, Online
时期1/08/216/08/21

指纹

探究 'Hashing based Efficient Inference for Image-Text Matching' 的科研主题。它们共同构成独一无二的指纹。

引用此