A Stacking Ensemble Approach for Supervised Video Summarization

Yubo An, Shenghui Zhao*, Guoqiang Zhang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Existing video summarization methods are classified into either shot-level or frame-level methods, which are individually used in a general way. This paper investigates the underlying complementarity between the frame-level and shot-level methods, and a stacking ensemble approach is proposed for supervised video summarization. Firstly, we build up a stacking model to predict both the key frame probabilities and the temporal interest segments simultaneously. The two components are then combined via soft decision fusion to obtain the final scores of each frame in the video. A joint loss function is proposed for the model training. The ablation experimental results show that the proposed method outperforms both the two corresponding individual method. Furthermore, extensive experimental results on two benchmark datasets shows its superior performance in comparison with the state-of-the-art methods.

源语言英语
主期刊名Proceedings of the 2022 4th International Conference on Video, Signal and Image Processing, VSIP 2022
出版商Association for Computing Machinery
122-127
页数6
ISBN(电子版)9781450397810
DOI
出版状态已出版 - 25 11月 2022
活动4th International Conference on Video, Signal and Image Processing, VSIP 2022 - Shanghai, 中国
期限: 25 11月 202227 11月 2022

出版系列

姓名ACM International Conference Proceeding Series

会议

会议4th International Conference on Video, Signal and Image Processing, VSIP 2022
国家/地区中国
Shanghai
时期25/11/2227/11/22

指纹

探究 'A Stacking Ensemble Approach for Supervised Video Summarization' 的科研主题。它们共同构成独一无二的指纹。

引用此