Static saliency vs. Dynamic saliency: A comparative study

Tam V. Nguyen; Mengdi Xu; Guangyu Gao; Mohan Kankanhalli; Qi Tian; Shuicheng Yan

doi:10.1145/2502081.2502128

Static saliency vs. Dynamic saliency: A comparative study

Tam V. Nguyen, Mengdi Xu, Guangyu Gao, Mohan Kankanhalli, Qi Tian, Shuicheng Yan

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

61 引用（Scopus）

摘要

Recently visual saliency has attracted wide attention of researchers in the computer vision and multimedia field. However, most of the visual saliency-related research was conducted on still images for studying static saliency. In this paper, we give a comprehensive comparative study for the first time of dynamic saliency (video shots) and static saliency (key frames of the corresponding video shots), and two key observations are obtained: 1) video saliency is often different from, yet quite related with, image saliency, and 2) camera motions, such as tilting, panning or zooming, affect dynamic saliency significantly. Motivated by these observations, we propose a novel camera motion and image saliency aware model for dynamic saliency prediction. The extensive experiments on two static-vs-dynamic saliency datasets collected by us show that our proposed method outperforms the state-of-the-art methods for dynamic saliency prediction. Finally, we also introduce the application of dynamic saliency prediction for dynamic video captioning, assisting people with hearing impairments to better entertain videos with only off-screen voices, e.g., documentary films, news videos and sports videos.

源语言	英语
主期刊名	MM 2013 - Proceedings of the 2013 ACM Multimedia Conference
页	987-996
页数	10
DOI	https://doi.org/10.1145/2502081.2502128
出版状态	已出版 - 2013
活动	21st ACM International Conference on Multimedia, MM 2013 - Barcelona, 西班牙期限: 21 10月 2013 → 25 10月 2013

出版系列

姓名	MM 2013 - Proceedings of the 2013 ACM Multimedia Conference

会议

会议	21st ACM International Conference on Multimedia, MM 2013
国家/地区	西班牙
市	Barcelona
时期	21/10/13 → 25/10/13

访问文件

10.1145/2502081.2502128

其它文件与链接

链接到 Scopus 的出版物

引用此

Nguyen, T. V., Xu, M., Gao, G., Kankanhalli, M., Tian, Q., & Yan, S. (2013). Static saliency vs. Dynamic saliency: A comparative study. 在 MM 2013 - Proceedings of the 2013 ACM Multimedia Conference (页码 987-996). (MM 2013 - Proceedings of the 2013 ACM Multimedia Conference). https://doi.org/10.1145/2502081.2502128

@inproceedings{44f32307140345edbb792fe03f74d619,

title = "Static saliency vs. Dynamic saliency: A comparative study",

abstract = "Recently visual saliency has attracted wide attention of researchers in the computer vision and multimedia field. However, most of the visual saliency-related research was conducted on still images for studying static saliency. In this paper, we give a comprehensive comparative study for the first time of dynamic saliency (video shots) and static saliency (key frames of the corresponding video shots), and two key observations are obtained: 1) video saliency is often different from, yet quite related with, image saliency, and 2) camera motions, such as tilting, panning or zooming, affect dynamic saliency significantly. Motivated by these observations, we propose a novel camera motion and image saliency aware model for dynamic saliency prediction. The extensive experiments on two static-vs-dynamic saliency datasets collected by us show that our proposed method outperforms the state-of-the-art methods for dynamic saliency prediction. Finally, we also introduce the application of dynamic saliency prediction for dynamic video captioning, assisting people with hearing impairments to better entertain videos with only off-screen voices, e.g., documentary films, news videos and sports videos.",

keywords = "Camera motion, Cinematography, Dynamic saliency, Static saliency",

author = "Nguyen, {Tam V.} and Mengdi Xu and Guangyu Gao and Mohan Kankanhalli and Qi Tian and Shuicheng Yan",

year = "2013",

doi = "10.1145/2502081.2502128",

language = "English",

isbn = "9781450324045",

series = "MM 2013 - Proceedings of the 2013 ACM Multimedia Conference",

pages = "987--996",

booktitle = "MM 2013 - Proceedings of the 2013 ACM Multimedia Conference",

note = "21st ACM International Conference on Multimedia, MM 2013 ; Conference date: 21-10-2013 Through 25-10-2013",

}

Nguyen, TV, Xu, M, Gao, G, Kankanhalli, M, Tian, Q & Yan, S 2013, Static saliency vs. Dynamic saliency: A comparative study. 在 MM 2013 - Proceedings of the 2013 ACM Multimedia Conference. MM 2013 - Proceedings of the 2013 ACM Multimedia Conference, 页码 987-996, 21st ACM International Conference on Multimedia, MM 2013, Barcelona, 西班牙, 21/10/13. https://doi.org/10.1145/2502081.2502128

TY - GEN

T1 - Static saliency vs. Dynamic saliency

T2 - 21st ACM International Conference on Multimedia, MM 2013

AU - Nguyen, Tam V.

AU - Xu, Mengdi

AU - Gao, Guangyu

AU - Kankanhalli, Mohan

AU - Tian, Qi

AU - Yan, Shuicheng

PY - 2013

Y1 - 2013

N2 - Recently visual saliency has attracted wide attention of researchers in the computer vision and multimedia field. However, most of the visual saliency-related research was conducted on still images for studying static saliency. In this paper, we give a comprehensive comparative study for the first time of dynamic saliency (video shots) and static saliency (key frames of the corresponding video shots), and two key observations are obtained: 1) video saliency is often different from, yet quite related with, image saliency, and 2) camera motions, such as tilting, panning or zooming, affect dynamic saliency significantly. Motivated by these observations, we propose a novel camera motion and image saliency aware model for dynamic saliency prediction. The extensive experiments on two static-vs-dynamic saliency datasets collected by us show that our proposed method outperforms the state-of-the-art methods for dynamic saliency prediction. Finally, we also introduce the application of dynamic saliency prediction for dynamic video captioning, assisting people with hearing impairments to better entertain videos with only off-screen voices, e.g., documentary films, news videos and sports videos.

AB - Recently visual saliency has attracted wide attention of researchers in the computer vision and multimedia field. However, most of the visual saliency-related research was conducted on still images for studying static saliency. In this paper, we give a comprehensive comparative study for the first time of dynamic saliency (video shots) and static saliency (key frames of the corresponding video shots), and two key observations are obtained: 1) video saliency is often different from, yet quite related with, image saliency, and 2) camera motions, such as tilting, panning or zooming, affect dynamic saliency significantly. Motivated by these observations, we propose a novel camera motion and image saliency aware model for dynamic saliency prediction. The extensive experiments on two static-vs-dynamic saliency datasets collected by us show that our proposed method outperforms the state-of-the-art methods for dynamic saliency prediction. Finally, we also introduce the application of dynamic saliency prediction for dynamic video captioning, assisting people with hearing impairments to better entertain videos with only off-screen voices, e.g., documentary films, news videos and sports videos.

KW - Camera motion

KW - Cinematography

KW - Dynamic saliency

KW - Static saliency

UR - http://www.scopus.com/inward/record.url?scp=84887455190&partnerID=8YFLogxK

U2 - 10.1145/2502081.2502128

DO - 10.1145/2502081.2502128

M3 - Conference contribution

AN - SCOPUS:84887455190

SN - 9781450324045

T3 - MM 2013 - Proceedings of the 2013 ACM Multimedia Conference

SP - 987

EP - 996

BT - MM 2013 - Proceedings of the 2013 ACM Multimedia Conference

Y2 - 21 October 2013 through 25 October 2013

ER -

Static saliency vs. Dynamic saliency: A comparative study

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此