A Cycle Architecture Based on Policy Gradient for Unsupervised Video Summarization

Yubo An, Shenghui Zhao*, Guoqiang Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes a cycle architecture based on policy gradients for unsupervised video summarization. Specifically, the Modified DSNet and DSN-attention net constitute a cycle architecture and promote each other in the training stage to achieve higher performance compared with the unsupervised methods that formulate video summarization as a sequential decision-making process. In the training stage, the DSN-attention net is trained by the policy gradient in combination with the additional MSE loss between the two outputs of the modified DSNet and DSN-attention net. Then the output of the DSN-attention net is taken for generating the labels to train the modified DSNet. As a result, a cycle architecture is built up for unsupervised video summarization. At the test stage, the final video summary is produced by the average fusion of the outputs of both the Modified DSNet and DSN attention net. Extensive experiments and analysis on two benchmark datasets demonstrate the effectiveness of our method and its superior performance in comparison with the state-of-the-art unsupervised methods.

Original languageEnglish
Title of host publicationProceedings of the 15th International Conference on Digital Image Processing, ICDIP 2023
PublisherAssociation for Computing Machinery
ISBN (Electronic)9798400708237
DOIs
Publication statusPublished - 19 May 2023
Event15th International Conference on Digital Image Processing, ICDIP 2023 - Nanjing, China
Duration: 19 May 202322 May 2023

Publication series

NameACM International Conference Proceeding Series

Conference

Conference15th International Conference on Digital Image Processing, ICDIP 2023
Country/TerritoryChina
CityNanjing
Period19/05/2322/05/23

Keywords

  • Cycle Architecture
  • DSN-Attention Net
  • Modified Dsnet
  • Shot Level
  • Unsupervised Video Summarization

Fingerprint

Dive into the research topics of 'A Cycle Architecture Based on Policy Gradient for Unsupervised Video Summarization'. Together they form a unique fingerprint.

Cite this