TY - GEN
T1 - A rule-based hierarchical framework for video parsing in compressed domain
AU - Gao, Li
AU - Hou, Chao Huan
AU - Wang, Dong Hui
AU - Yan, Shefeng
PY - 2009
Y1 - 2009
N2 - In this paper, a rule-based hierarchical approach is proposed for video parsing in the compressed domain. In this approach, three core factors of video parsing that is feature space, discontinuity metric and decision rule are stressed to improve the parsing performance. Firstly, the PCA-based feature is extracted in the compressed domain as the input of the framework, which shows superior performance than the traditional features, especially in the compressed domain. Secondly, an adaptive temporal multi-scale discontinuity metric is designed to measure the temporal inconsistency of the visual feature between two video frames. Finally, through well characterizing the local activity of the shot boundary, a reasonable decision rules based on the adaptive threshold is proposed to parse the video into shots. Through this PCA-based framework, the negative influence of significant camera or objects movement to the video parsing can be greatly depressed, and thus the performance is greatly improved. The proposed approach is implemented on the TREC video test repository to validate the performance.
AB - In this paper, a rule-based hierarchical approach is proposed for video parsing in the compressed domain. In this approach, three core factors of video parsing that is feature space, discontinuity metric and decision rule are stressed to improve the parsing performance. Firstly, the PCA-based feature is extracted in the compressed domain as the input of the framework, which shows superior performance than the traditional features, especially in the compressed domain. Secondly, an adaptive temporal multi-scale discontinuity metric is designed to measure the temporal inconsistency of the visual feature between two video frames. Finally, through well characterizing the local activity of the shot boundary, a reasonable decision rules based on the adaptive threshold is proposed to parse the video into shots. Through this PCA-based framework, the negative influence of significant camera or objects movement to the video parsing can be greatly depressed, and thus the performance is greatly improved. The proposed approach is implemented on the TREC video test repository to validate the performance.
KW - Principal component analysis
KW - Shot boundary detection
KW - Video parsing
UR - http://www.scopus.com/inward/record.url?scp=77951945761&partnerID=8YFLogxK
U2 - 10.1109/ICIP.2009.5413542
DO - 10.1109/ICIP.2009.5413542
M3 - Conference contribution
AN - SCOPUS:77951945761
SN - 9781424456543
T3 - Proceedings - International Conference on Image Processing, ICIP
SP - 4357
EP - 4360
BT - 2009 IEEE International Conference on Image Processing, ICIP 2009 - Proceedings
PB - IEEE Computer Society
T2 - 2009 IEEE International Conference on Image Processing, ICIP 2009
Y2 - 7 November 2009 through 10 November 2009
ER -