Adaptive Online Learning for Video Object Segmentation

Li Wei; Chunyan Xu; Tong Zhang

doi:10.1007/978-3-030-36189-1_2

Adaptive Online Learning for Video Object Segmentation

Li Wei, Chunyan Xu^*, Tong Zhang

^*此作品的通讯作者

Nanjing University of Science and Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

In this work, we address the problem of video object segmentation (VOS), namely segmenting specific objects throughout a video sequence when given only an annotated first frame. Previous VOS methods based on deep neural networks often solves this problem by fine-tuning the segmentation model in the first frame of the test video sequence, which is time-consuming and can not be well adapted to the current target video. In this paper, we proposed the adaptive online learning for video object segmentation (AOL-VOS), which adaptively optimizes the network parameters and hyperparameters of segmentation model for better predicting the segmentation results. Specifically, we first pre-train the segmentation model with the static video frames and then learn the effective adaptation strategy on the training set by optimizing both network parameters and hyperparameters. In the testing process, we learn how to online adapt the learned segmentation model to the specific testing video sequence and the corresponding future video frames, where the confidence patterns is employed to constrain/guide the implementation of adaptive learning process by fusing both object appearance and motion cue information. Comprehensive evaluations on Davis 16 and SegTrack V2 datasets well demonstrate the significant superiority of our proposed AOL-VOS over other state-of-the-arts for video object segmentation task.

源语言	英语
主期刊名	Intelligence Science and Big Data Engineering. Visual Data Engineering - 9th International Conference, IScIDE 2019, Proceedings, Part 1
编辑	Zhen Cui, Jinshan Pan, Shanshan Zhang, Liang Xiao, Jian Yang
出版商	Springer
页	22-34
页数	13
ISBN（印刷版）	9783030361884
DOI	https://doi.org/10.1007/978-3-030-36189-1_2
出版状态	已出版 - 2019
已对外发布	是
活动	9th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2019 - Nanjing, 中国期限: 17 10月 2019 → 20 10月 2019

出版系列

姓名	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
卷	11935 LNCS
ISSN（印刷版）	0302-9743
ISSN（电子版）	1611-3349

会议

会议	9th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2019
国家/地区	中国
市	Nanjing
时期	17/10/19 → 20/10/19

访问文件

10.1007/978-3-030-36189-1_2

其它文件与链接

链接到 Scopus 的出版物

引用此

Wei, L., Xu, C., & Zhang, T. (2019). Adaptive Online Learning for Video Object Segmentation. 在 Z. Cui, J. Pan, S. Zhang, L. Xiao, & J. Yang (编辑), Intelligence Science and Big Data Engineering. Visual Data Engineering - 9th International Conference, IScIDE 2019, Proceedings, Part 1 (页码 22-34). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 11935 LNCS). Springer. https://doi.org/10.1007/978-3-030-36189-1_2

Wei, Li ; Xu, Chunyan ; Zhang, Tong. / Adaptive Online Learning for Video Object Segmentation. Intelligence Science and Big Data Engineering. Visual Data Engineering - 9th International Conference, IScIDE 2019, Proceedings, Part 1. 编辑 / Zhen Cui ; Jinshan Pan ; Shanshan Zhang ; Liang Xiao ; Jian Yang. Springer, 2019. 页码 22-34 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{82d4a8e5f2804e06afb2c55484e623e5,

title = "Adaptive Online Learning for Video Object Segmentation",

abstract = "In this work, we address the problem of video object segmentation (VOS), namely segmenting specific objects throughout a video sequence when given only an annotated first frame. Previous VOS methods based on deep neural networks often solves this problem by fine-tuning the segmentation model in the first frame of the test video sequence, which is time-consuming and can not be well adapted to the current target video. In this paper, we proposed the adaptive online learning for video object segmentation (AOL-VOS), which adaptively optimizes the network parameters and hyperparameters of segmentation model for better predicting the segmentation results. Specifically, we first pre-train the segmentation model with the static video frames and then learn the effective adaptation strategy on the training set by optimizing both network parameters and hyperparameters. In the testing process, we learn how to online adapt the learned segmentation model to the specific testing video sequence and the corresponding future video frames, where the confidence patterns is employed to constrain/guide the implementation of adaptive learning process by fusing both object appearance and motion cue information. Comprehensive evaluations on Davis 16 and SegTrack V2 datasets well demonstrate the significant superiority of our proposed AOL-VOS over other state-of-the-arts for video object segmentation task.",

keywords = "Adaptation, Online-learning, Video object segmentation",

author = "Li Wei and Chunyan Xu and Tong Zhang",

note = "Publisher Copyright: {\textcopyright} 2019, Springer Nature Switzerland AG.; 9th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2019 ; Conference date: 17-10-2019 Through 20-10-2019",

year = "2019",

doi = "10.1007/978-3-030-36189-1_2",

language = "English",

isbn = "9783030361884",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "22--34",

editor = "Zhen Cui and Jinshan Pan and Shanshan Zhang and Liang Xiao and Jian Yang",

booktitle = "Intelligence Science and Big Data Engineering. Visual Data Engineering - 9th International Conference, IScIDE 2019, Proceedings, Part 1",

address = "Germany",

}

Wei, L, Xu, C & Zhang, T 2019, Adaptive Online Learning for Video Object Segmentation. 在 Z Cui, J Pan, S Zhang, L Xiao & J Yang (编辑), Intelligence Science and Big Data Engineering. Visual Data Engineering - 9th International Conference, IScIDE 2019, Proceedings, Part 1. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 卷 11935 LNCS, Springer, 页码 22-34, 9th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2019, Nanjing, 中国, 17/10/19. https://doi.org/10.1007/978-3-030-36189-1_2

Adaptive Online Learning for Video Object Segmentation. / Wei, Li; Xu, Chunyan; Zhang, Tong.
Intelligence Science and Big Data Engineering. Visual Data Engineering - 9th International Conference, IScIDE 2019, Proceedings, Part 1. 编辑 / Zhen Cui; Jinshan Pan; Shanshan Zhang; Liang Xiao; Jian Yang. Springer, 2019. 页码 22-34 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 11935 LNCS).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Adaptive Online Learning for Video Object Segmentation

AU - Wei, Li

AU - Xu, Chunyan

AU - Zhang, Tong

PY - 2019

Y1 - 2019

N2 - In this work, we address the problem of video object segmentation (VOS), namely segmenting specific objects throughout a video sequence when given only an annotated first frame. Previous VOS methods based on deep neural networks often solves this problem by fine-tuning the segmentation model in the first frame of the test video sequence, which is time-consuming and can not be well adapted to the current target video. In this paper, we proposed the adaptive online learning for video object segmentation (AOL-VOS), which adaptively optimizes the network parameters and hyperparameters of segmentation model for better predicting the segmentation results. Specifically, we first pre-train the segmentation model with the static video frames and then learn the effective adaptation strategy on the training set by optimizing both network parameters and hyperparameters. In the testing process, we learn how to online adapt the learned segmentation model to the specific testing video sequence and the corresponding future video frames, where the confidence patterns is employed to constrain/guide the implementation of adaptive learning process by fusing both object appearance and motion cue information. Comprehensive evaluations on Davis 16 and SegTrack V2 datasets well demonstrate the significant superiority of our proposed AOL-VOS over other state-of-the-arts for video object segmentation task.

AB - In this work, we address the problem of video object segmentation (VOS), namely segmenting specific objects throughout a video sequence when given only an annotated first frame. Previous VOS methods based on deep neural networks often solves this problem by fine-tuning the segmentation model in the first frame of the test video sequence, which is time-consuming and can not be well adapted to the current target video. In this paper, we proposed the adaptive online learning for video object segmentation (AOL-VOS), which adaptively optimizes the network parameters and hyperparameters of segmentation model for better predicting the segmentation results. Specifically, we first pre-train the segmentation model with the static video frames and then learn the effective adaptation strategy on the training set by optimizing both network parameters and hyperparameters. In the testing process, we learn how to online adapt the learned segmentation model to the specific testing video sequence and the corresponding future video frames, where the confidence patterns is employed to constrain/guide the implementation of adaptive learning process by fusing both object appearance and motion cue information. Comprehensive evaluations on Davis 16 and SegTrack V2 datasets well demonstrate the significant superiority of our proposed AOL-VOS over other state-of-the-arts for video object segmentation task.

KW - Adaptation

KW - Online-learning

KW - Video object segmentation

UR - http://www.scopus.com/inward/record.url?scp=85077116951&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-36189-1_2

DO - 10.1007/978-3-030-36189-1_2

M3 - Conference contribution

AN - SCOPUS:85077116951

SN - 9783030361884

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 22

EP - 34

BT - Intelligence Science and Big Data Engineering. Visual Data Engineering - 9th International Conference, IScIDE 2019, Proceedings, Part 1

A2 - Cui, Zhen

A2 - Pan, Jinshan

A2 - Zhang, Shanshan

A2 - Xiao, Liang

A2 - Yang, Jian

PB - Springer

T2 - 9th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2019

Y2 - 17 October 2019 through 20 October 2019

ER -

Wei L, Xu C, Zhang T. Adaptive Online Learning for Video Object Segmentation. 在 Cui Z, Pan J, Zhang S, Xiao L, Yang J, 编辑, Intelligence Science and Big Data Engineering. Visual Data Engineering - 9th International Conference, IScIDE 2019, Proceedings, Part 1. Springer. 2019. 页码 22-34. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-36189-1_2

Adaptive Online Learning for Video Object Segmentation

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此