A computable visual attention model for video skimming

Longfei Zhang; Yuanda Cao; Gangyi Ding; Yong Wang

doi:10.1109/ISM.2008.117

A computable visual attention model for video skimming

Longfei Zhang^*, Yuanda Cao, Gangyi Ding, Yong Wang

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

17 Citations (Scopus)

Abstract

A novel computable visual attention model (VAM) for video skimming algorithm is proposed. Videos bear more motion features than images do. Objects in videos cause different attention effects, depending on various situations, positions, motions, and appearances. The static visual attention model is based on spatial distribution, visual object, or both, but fall short in solving temporal attention effects. The proposed VAM model adopts the alive-time(AT) of a visual object as a new descriptor to improve the accuracy of locating highlight in a video clip, then produces better video skimming results. The model is represented by a set of descriptors to be computable and provide a generic framework for video analysis. The temporal variations of attention value in a video clip are weighted by non-linear Chi-square distribution. Then the highlights of the frames in thevideo are represented by the attention window (AW) and the attention values of the visual objects (AOs) are tracked and used to generate the attention curve of the video. At last, a video skimming strategy is used to select the highlights of the video by analyzing the attention curve. The experiment result shows that the proposed model makes the skimming results 15%~25% shorter than previous methods.

Original language	English
Title of host publication	Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008
Pages	667-672
Number of pages	6
DOIs	https://doi.org/10.1109/ISM.2008.117
Publication status	Published - 2008
Event	10th IEEE International Symposium on Multimedia, ISM 2008 - Berkeley, CA, United States Duration: 15 Dec 2008 → 17 Dec 2008

Publication series

Name	Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008

Conference

Conference	10th IEEE International Symposium on Multimedia, ISM 2008
Country/Territory	United States
City	Berkeley, CA
Period	15/12/08 → 17/12/08

Access to Document

10.1109/ISM.2008.117

Cite this

Zhang, L., Cao, Y., Ding, G., & Wang, Y. (2008). A computable visual attention model for video skimming. In Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008 (pp. 667-672). Article 4741245 (Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008). https://doi.org/10.1109/ISM.2008.117

@inproceedings{83ca5d1e26154412a6b50e6035818c9d,

title = "A computable visual attention model for video skimming",

abstract = "A novel computable visual attention model (VAM) for video skimming algorithm is proposed. Videos bear more motion features than images do. Objects in videos cause different attention effects, depending on various situations, positions, motions, and appearances. The static visual attention model is based on spatial distribution, visual object, or both, but fall short in solving temporal attention effects. The proposed VAM model adopts the alive-time(AT) of a visual object as a new descriptor to improve the accuracy of locating highlight in a video clip, then produces better video skimming results. The model is represented by a set of descriptors to be computable and provide a generic framework for video analysis. The temporal variations of attention value in a video clip are weighted by non-linear Chi-square distribution. Then the highlights of the frames in thevideo are represented by the attention window (AW) and the attention values of the visual objects (AOs) are tracked and used to generate the attention curve of the video. At last, a video skimming strategy is used to select the highlights of the video by analyzing the attention curve. The experiment result shows that the proposed model makes the skimming results 15%~25% shorter than previous methods.",

author = "Longfei Zhang and Yuanda Cao and Gangyi Ding and Yong Wang",

year = "2008",

doi = "10.1109/ISM.2008.117",

language = "English",

isbn = "9780769534541",

series = "Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008",

pages = "667--672",

booktitle = "Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008",

note = "10th IEEE International Symposium on Multimedia, ISM 2008 ; Conference date: 15-12-2008 Through 17-12-2008",

}

Zhang, L, Cao, Y, Ding, G & Wang, Y 2008, A computable visual attention model for video skimming. in Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008., 4741245, Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008, pp. 667-672, 10th IEEE International Symposium on Multimedia, ISM 2008, Berkeley, CA, United States, 15/12/08. https://doi.org/10.1109/ISM.2008.117

A computable visual attention model for video skimming. / Zhang, Longfei; Cao, Yuanda; Ding, Gangyi et al.
Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008. 2008. p. 667-672 4741245 (Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - A computable visual attention model for video skimming

AU - Zhang, Longfei

AU - Cao, Yuanda

AU - Ding, Gangyi

AU - Wang, Yong

PY - 2008

Y1 - 2008

N2 - A novel computable visual attention model (VAM) for video skimming algorithm is proposed. Videos bear more motion features than images do. Objects in videos cause different attention effects, depending on various situations, positions, motions, and appearances. The static visual attention model is based on spatial distribution, visual object, or both, but fall short in solving temporal attention effects. The proposed VAM model adopts the alive-time(AT) of a visual object as a new descriptor to improve the accuracy of locating highlight in a video clip, then produces better video skimming results. The model is represented by a set of descriptors to be computable and provide a generic framework for video analysis. The temporal variations of attention value in a video clip are weighted by non-linear Chi-square distribution. Then the highlights of the frames in thevideo are represented by the attention window (AW) and the attention values of the visual objects (AOs) are tracked and used to generate the attention curve of the video. At last, a video skimming strategy is used to select the highlights of the video by analyzing the attention curve. The experiment result shows that the proposed model makes the skimming results 15%~25% shorter than previous methods.

AB - A novel computable visual attention model (VAM) for video skimming algorithm is proposed. Videos bear more motion features than images do. Objects in videos cause different attention effects, depending on various situations, positions, motions, and appearances. The static visual attention model is based on spatial distribution, visual object, or both, but fall short in solving temporal attention effects. The proposed VAM model adopts the alive-time(AT) of a visual object as a new descriptor to improve the accuracy of locating highlight in a video clip, then produces better video skimming results. The model is represented by a set of descriptors to be computable and provide a generic framework for video analysis. The temporal variations of attention value in a video clip are weighted by non-linear Chi-square distribution. Then the highlights of the frames in thevideo are represented by the attention window (AW) and the attention values of the visual objects (AOs) are tracked and used to generate the attention curve of the video. At last, a video skimming strategy is used to select the highlights of the video by analyzing the attention curve. The experiment result shows that the proposed model makes the skimming results 15%~25% shorter than previous methods.

UR - http://www.scopus.com/inward/record.url?scp=62949213983&partnerID=8YFLogxK

U2 - 10.1109/ISM.2008.117

DO - 10.1109/ISM.2008.117

M3 - Conference contribution

AN - SCOPUS:62949213983

SN - 9780769534541

T3 - Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008

SP - 667

EP - 672

BT - Proceedings - 10th IEEE International Symposium on Multimedia, ISM 2008

T2 - 10th IEEE International Symposium on Multimedia, ISM 2008

Y2 - 15 December 2008 through 17 December 2008

ER -

A computable visual attention model for video skimming

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this