TY - JOUR
T1 - Movie scene recognition using panoramic frame and representative feature patches
AU - Gao, Guang Yu
AU - Ma, Hua Dong
PY - 2014/1
Y1 - 2014/1
N2 - Recognizing scene information in images or videos, such as locating the objects and answering "Where am I?", has attracted much attention in computer vision research field. Many existing scene recognition methods focus on static images, and cannot achieve satisfactory results on videos which contain more complex scenes features than images. In this paper, we propose a robust movie scene recognition approach based on panoramic frame and representative feature patch. More specifically, the movie is first efficiently segmented into video shots and scenes. Secondly, we introduce a novel key-frame extraction method using panoramic frame and also a local feature extraction process is applied to get the representative feature patches (RFPs) in each video shot. Thirdly, a Latent Dirichlet Allocation (LDA) based recognition model is trained to recognize the scene within each individual video scene clip. The correlations between video clips are considered to enhance the recognition performance. When our proposed approach is implemented to recognize the scene in realistic movies, the experimental results shows that it can achieve satisfactory performance.
AB - Recognizing scene information in images or videos, such as locating the objects and answering "Where am I?", has attracted much attention in computer vision research field. Many existing scene recognition methods focus on static images, and cannot achieve satisfactory results on videos which contain more complex scenes features than images. In this paper, we propose a robust movie scene recognition approach based on panoramic frame and representative feature patch. More specifically, the movie is first efficiently segmented into video shots and scenes. Secondly, we introduce a novel key-frame extraction method using panoramic frame and also a local feature extraction process is applied to get the representative feature patches (RFPs) in each video shot. Thirdly, a Latent Dirichlet Allocation (LDA) based recognition model is trained to recognize the scene within each individual video scene clip. The correlations between video clips are considered to enhance the recognition performance. When our proposed approach is implemented to recognize the scene in realistic movies, the experimental results shows that it can achieve satisfactory performance.
KW - key-frame extraction
KW - movie scene recognition
KW - panoramic frame
KW - representative feature
UR - http://www.scopus.com/inward/record.url?scp=84893300097&partnerID=8YFLogxK
U2 - 10.1007/s11390-014-1418-9
DO - 10.1007/s11390-014-1418-9
M3 - Article
AN - SCOPUS:84893300097
SN - 1000-9000
VL - 29
SP - 155
EP - 164
JO - Journal of Computer Science and Technology
JF - Journal of Computer Science and Technology
IS - 1
ER -