Static saliency vs. Dynamic saliency: A comparative study

Tam V. Nguyen, Mengdi Xu, Guangyu Gao, Mohan Kankanhalli, Qi Tian, Shuicheng Yan

科研成果: 书/报告/会议事项章节会议稿件同行评审

60 引用 (Scopus)

摘要

Recently visual saliency has attracted wide attention of researchers in the computer vision and multimedia field. However, most of the visual saliency-related research was conducted on still images for studying static saliency. In this paper, we give a comprehensive comparative study for the first time of dynamic saliency (video shots) and static saliency (key frames of the corresponding video shots), and two key observations are obtained: 1) video saliency is often different from, yet quite related with, image saliency, and 2) camera motions, such as tilting, panning or zooming, affect dynamic saliency significantly. Motivated by these observations, we propose a novel camera motion and image saliency aware model for dynamic saliency prediction. The extensive experiments on two static-vs-dynamic saliency datasets collected by us show that our proposed method outperforms the state-of-the-art methods for dynamic saliency prediction. Finally, we also introduce the application of dynamic saliency prediction for dynamic video captioning, assisting people with hearing impairments to better entertain videos with only off-screen voices, e.g., documentary films, news videos and sports videos.

源语言英语
主期刊名MM 2013 - Proceedings of the 2013 ACM Multimedia Conference
987-996
页数10
DOI
出版状态已出版 - 2013
活动21st ACM International Conference on Multimedia, MM 2013 - Barcelona, 西班牙
期限: 21 10月 201325 10月 2013

出版系列

姓名MM 2013 - Proceedings of the 2013 ACM Multimedia Conference

会议

会议21st ACM International Conference on Multimedia, MM 2013
国家/地区西班牙
Barcelona
时期21/10/1325/10/13

指纹

探究 'Static saliency vs. Dynamic saliency: A comparative study' 的科研主题。它们共同构成独一无二的指纹。

引用此