Adaptive Online Learning for Video Object Segmentation

Li Wei, Chunyan Xu*, Tong Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this work, we address the problem of video object segmentation (VOS), namely segmenting specific objects throughout a video sequence when given only an annotated first frame. Previous VOS methods based on deep neural networks often solves this problem by fine-tuning the segmentation model in the first frame of the test video sequence, which is time-consuming and can not be well adapted to the current target video. In this paper, we proposed the adaptive online learning for video object segmentation (AOL-VOS), which adaptively optimizes the network parameters and hyperparameters of segmentation model for better predicting the segmentation results. Specifically, we first pre-train the segmentation model with the static video frames and then learn the effective adaptation strategy on the training set by optimizing both network parameters and hyperparameters. In the testing process, we learn how to online adapt the learned segmentation model to the specific testing video sequence and the corresponding future video frames, where the confidence patterns is employed to constrain/guide the implementation of adaptive learning process by fusing both object appearance and motion cue information. Comprehensive evaluations on Davis 16 and SegTrack V2 datasets well demonstrate the significant superiority of our proposed AOL-VOS over other state-of-the-arts for video object segmentation task.

Original languageEnglish
Title of host publicationIntelligence Science and Big Data Engineering. Visual Data Engineering - 9th International Conference, IScIDE 2019, Proceedings, Part 1
EditorsZhen Cui, Jinshan Pan, Shanshan Zhang, Liang Xiao, Jian Yang
PublisherSpringer
Pages22-34
Number of pages13
ISBN (Print)9783030361884
DOIs
Publication statusPublished - 2019
Externally publishedYes
Event9th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2019 - Nanjing, China
Duration: 17 Oct 201920 Oct 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11935 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2019
Country/TerritoryChina
CityNanjing
Period17/10/1920/10/19

Keywords

  • Adaptation
  • Online-learning
  • Video object segmentation

Fingerprint

Dive into the research topics of 'Adaptive Online Learning for Video Object Segmentation'. Together they form a unique fingerprint.

Cite this

Wei, L., Xu, C., & Zhang, T. (2019). Adaptive Online Learning for Video Object Segmentation. In Z. Cui, J. Pan, S. Zhang, L. Xiao, & J. Yang (Eds.), Intelligence Science and Big Data Engineering. Visual Data Engineering - 9th International Conference, IScIDE 2019, Proceedings, Part 1 (pp. 22-34). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11935 LNCS). Springer. https://doi.org/10.1007/978-3-030-36189-1_2