Target-Aware Object Discovery and Association for Unsupervised Video Multi-Object Segmentation

Tianfei Zhou, Jianwu Li*, Xueyi Li, Ling Shao

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

38 引用 (Scopus)

摘要

This paper addresses the task of unsupervised video multi-object segmentation. Current approaches follow a two-stage paradigm: 1) detect object proposals using pre-trained Mask R-CNN, and 2) conduct generic feature matching for temporal association using re-identification techniques. However, the generic features, widely used in both stages, are not reliable for characterizing unseen objects, leading to poor generalization. To address this, we introduce a novel approach for more accurate and efficient spatio-temporal segmentation. In particular, to address instance discrimination, we propose to combine foreground region estimation and instance grouping together in one network, and additionally introduce temporal guidance for segmenting each frame, enabling more accurate object discovery. For temporal association, we complement current video object segmentation architectures with a discriminative appearance model, capable of capturing more fine-grained target-specific information. Given object proposals from the instance discrimination network, three essential strategies are adopted to achieve accurate segmentation: 1) target-specific tracking using a memory-augmented appearance model; 2) target-agnostic verification to trace possible tracklets for the proposal; 3) adaptive memory updating using the verified segments. We evaluate the proposed approach on DAVIS17 and YouTube-VIS, and the results demonstrate that it outperforms state-of-the-art methods both in segmentation accuracy and inference speed.

源语言英语
主期刊名Proceedings - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2021
出版商IEEE Computer Society
6981-6990
页数10
ISBN(电子版)9781665445092
DOI
出版状态已出版 - 2021
活动2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2021 - Virtual, Online, 美国
期限: 19 6月 202125 6月 2021

出版系列

姓名Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
ISSN(印刷版)1063-6919

会议

会议2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2021
国家/地区美国
Virtual, Online
时期19/06/2125/06/21

指纹

探究 'Target-Aware Object Discovery and Association for Unsupervised Video Multi-Object Segmentation' 的科研主题。它们共同构成独一无二的指纹。

引用此