A Survey on Deep Learning Technique for Video Segmentation

Tianfei Zhou; Fatih Porikli; David J. Crandall; Luc Van Gool; Wenguan Wang

doi:10.1109/TPAMI.2022.3225573

A Survey on Deep Learning Technique for Video Segmentation

Tianfei Zhou, Fatih Porikli, David J. Crandall, Luc Van Gool, Wenguan Wang^*

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

85 引用（Scopus）

摘要

Video segmentation - partitioning video frames into multiple segments or objects - plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.

源语言	英语
页（从-至）	7099-7122
页数	24
期刊	IEEE Transactions on Pattern Analysis and Machine Intelligence
卷	45
期	6
DOI	https://doi.org/10.1109/TPAMI.2022.3225573
出版状态	已出版 - 1 6月 2023
已对外发布	是

访问文件

10.1109/TPAMI.2022.3225573

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{1bd731e9ed3c4d0f878f776a4186a60a,

title = "A Survey on Deep Learning Technique for Video Segmentation",

abstract = "Video segmentation - partitioning video frames into multiple segments or objects - plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.",

keywords = "Video segmentation, deep learning, video object segmentation, video semantic segmentation",

author = "Tianfei Zhou and Fatih Porikli and Crandall, {David J.} and {Van Gool}, Luc and Wenguan Wang",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2023",

month = jun,

day = "1",

doi = "10.1109/TPAMI.2022.3225573",

language = "English",

volume = "45",

pages = "7099--7122",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "6",

}

TY - JOUR

T1 - A Survey on Deep Learning Technique for Video Segmentation

AU - Zhou, Tianfei

AU - Porikli, Fatih

AU - Crandall, David J.

AU - Van Gool, Luc

AU - Wang, Wenguan

PY - 2023/6/1

Y1 - 2023/6/1

N2 - Video segmentation - partitioning video frames into multiple segments or objects - plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.

AB - Video segmentation - partitioning video frames into multiple segments or objects - plays a critical role in a broad range of practical applications, from enhancing visual effects in movie, to understanding scenes in autonomous driving, to creating virtual background in video conferencing. Recently, with the renaissance of connectionism in computer vision, there has been an influx of deep learning based approaches for video segmentation that have delivered compelling performance. In this survey, we comprehensively review two basic lines of research - generic object segmentation (of unknown categories) in videos, and video semantic segmentation - by introducing their respective task settings, background concepts, perceived need, development history, and main challenges. We also offer a detailed overview of representative literature on both methods and datasets. We further benchmark the reviewed methods on several well-known datasets. Finally, we point out open issues in this field, and suggest opportunities for further research. We also provide a public website to continuously track developments in this fast advancing field: https://github.com/tfzhou/VS-Survey.

KW - Video segmentation

KW - deep learning

KW - video object segmentation

KW - video semantic segmentation

UR - http://www.scopus.com/inward/record.url?scp=85144074257&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2022.3225573

DO - 10.1109/TPAMI.2022.3225573

M3 - Article

AN - SCOPUS:85144074257

SN - 0162-8828

VL - 45

SP - 7099

EP - 7122

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 6

ER -

A Survey on Deep Learning Technique for Video Segmentation

摘要

访问文件

其它文件与链接

指纹

引用此