Video saliency detection with gated CNN and residual architecture

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper proposes a novel gated recurrently end-to-end model of video salient object detection task to enhance both the performances and efficiency. A context aware aggregation module is first introduced for spatial features learning, these spatial features are then fused directly with a squeeze and excitation block. A two stage gated recurrent module is follow for temporal feature learning and shortcut connections between spatial and temporal features are add to alleviate the degradation problem. Our model has real-time speed of 25 fps on a single GPU which is similar to the approaches for static image salient object detection. We evaluate the proposed method on DAVIS and FBMS dataset. Experimental results show that comparing with other state-of-art saliency detectors, our method is more effective and efficient.

Original languageEnglish
Title of host publicationProceedings - 19th IEEE International Conference on Data Mining Workshops, ICDMW 2019
EditorsPanagiotis Papapetrou, Xueqi Cheng, Qing He
PublisherIEEE Computer Society
Pages898-904
Number of pages7
ISBN (Electronic)9781728146034
DOIs
Publication statusPublished - Nov 2019
Event19th IEEE International Conference on Data Mining Workshops, ICDMW 2019 - Beijing, China
Duration: 8 Nov 201911 Nov 2019

Publication series

NameIEEE International Conference on Data Mining Workshops, ICDMW
Volume2019-November
ISSN (Print)2375-9232
ISSN (Electronic)2375-9259

Conference

Conference19th IEEE International Conference on Data Mining Workshops, ICDMW 2019
Country/TerritoryChina
CityBeijing
Period8/11/1911/11/19

Keywords

  • video saliency detection, gated CNN, shortcut connection

Fingerprint

Dive into the research topics of 'Video saliency detection with gated CNN and residual architecture'. Together they form a unique fingerprint.

Cite this