Multi-modality movie scene detection using Kernel Canonical Correlation Analysis

Guangyu Gao*, Huadong Ma

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)

Abstract

Scene detection is the fundamental step for efficient accessing and browsing videos. In this paper, we propose to segment movie into scenes which utilizes fused visual and audio features. The movie is first segmented into shots by an accelerating algorithm, and the key frames are extracted later. While feature movies are often filmed in open and dynamic environments using moving cameras and have continuously changing contents, we focus on the association extraction of visual and audio features. Then, based on the Kernel Canonical Correlation Analysis (KCCA), all these features are fused for scene detection. Finally, spatial-temporal coherent shots construct the similarity graph which is partitioned to generate the scene boundaries. We conduct extensive experiments on several movies, and the results show that our approach can efficiently detect the scene boundaries with a satisfactory performance.

Original languageEnglish
Title of host publicationICPR 2012 - 21st International Conference on Pattern Recognition
Pages3074-3077
Number of pages4
Publication statusPublished - 2012
Externally publishedYes
Event21st International Conference on Pattern Recognition, ICPR 2012 - Tsukuba, Japan
Duration: 11 Nov 201215 Nov 2012

Publication series

NameProceedings - International Conference on Pattern Recognition
ISSN (Print)1051-4651

Conference

Conference21st International Conference on Pattern Recognition, ICPR 2012
Country/TerritoryJapan
CityTsukuba
Period11/11/1215/11/12

Fingerprint

Dive into the research topics of 'Multi-modality movie scene detection using Kernel Canonical Correlation Analysis'. Together they form a unique fingerprint.

Cite this