TY - JOUR
T1 - A Survey on Deep Learning Technique for Video Segmentation
AU - Zhou, Tianfei
AU - Porikli, Fatih
AU - Crandall, David J.
AU - Van Gool, Luc
AU - Wang, Wenguan
N1 - Publisher Copyright:
© 1979-2012 IEEE.
PY - 2023/6/1
Y1 - 2023/6/1
N2 - Video segmentation - partitioning video frames into multiple segments or objects - plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.
AB - Video segmentation - partitioning video frames into multiple segments or objects - plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.
KW - Video segmentation
KW - deep learning
KW - video object segmentation
KW - video semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=85144074257&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2022.3225573
DO - 10.1109/TPAMI.2022.3225573
M3 - Article
AN - SCOPUS:85144074257
SN - 0162-8828
VL - 45
SP - 7099
EP - 7122
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 6
ER -