A Survey on Deep Learning Technique for Video Segmentation

Tianfei Zhou, Fatih Porikli, David J. Crandall, Luc Van Gool, Wenguan Wang*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

85 引用 (Scopus)

摘要

Video segmentation - partitioning video frames into multiple segments or objects - plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.

源语言英语
页(从-至)7099-7122
页数24
期刊IEEE Transactions on Pattern Analysis and Machine Intelligence
45
6
DOI
出版状态已出版 - 1 6月 2023
已对外发布

指纹

探究 'A Survey on Deep Learning Technique for Video Segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此