Video retrieval based on deep convolutional neural network

Yajiao Dong, Jianguo Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

14 引用 (Scopus)

摘要

Recently, with the enormous growth of online videos, fast video retrieval research has received increasing attention. As an extension of image hashing techniques, traditional video hashing methods mainly depend on hand-crafted features and transform the real-valued features into binary hash codes. As videos provide far more diverse and complex visual information than images, extracting features from videos is much more challenging than that from images. Therefore, high-level semantic features to represent videos are needed rather than low-level hand-crafted methods. In this paper, a deep convolutional neural network is proposed to extract high-level semantic features and a binary hash function is then integrated into this framework to achieve an end-to-end optimization. Particularly, our approach also combines triplet loss function which preserves the relative similarity and difference of videos and classification loss function as the optimization objective. Experiments have been performed on two public datasets and the results demonstrate the superiority of our proposed method compared with other state-of-the-art video retrieval methods.

源语言英语
主期刊名ICMSSP 2018 - 2018 3rd International Conference on Multimedia Systems and Signal Processing
出版商Association for Computing Machinery
12-16
页数5
ISBN(电子版)9781450364577
DOI
出版状态已出版 - 28 4月 2018
活动3rd International Conference on Multimedia Systems and Signal Processing, ICMSSP 2018 - Shenzhen, 中国
期限: 28 4月 201830 4月 2018

出版系列

姓名ACM International Conference Proceeding Series

会议

会议3rd International Conference on Multimedia Systems and Signal Processing, ICMSSP 2018
国家/地区中国
Shenzhen
时期28/04/1830/04/18

指纹

探究 'Video retrieval based on deep convolutional neural network' 的科研主题。它们共同构成独一无二的指纹。

引用此