Learning to generate video object segment proposals

科研成果: 书/报告/会议事项章节会议稿件同行评审

4 引用 (Scopus)

摘要

This paper proposes a fully automatic pipeline to generate accurate object segment proposals in realistic videos. Our approach first detects generic object proposals for all video frames and then learns to rank them using a Convolutional Neural Networks (CNN) descriptor built on appearance and motion cues. The ambiguity of the proposal set can be reduced while the quality can be retained as highly as possible Next, high-scoring proposals are greedily tracked over the entire sequence into distinct tracklets. Observing that the proposal tracklet set at this stage is noisy and redundant, we perform a tracklet selection scheme to suppress the highly overlapped tracklets, and detect occlusions based on appearance and location information. Finally, we exploit holistic appearance cues for refinement of video segment proposals to obtain pixel-accurate segmentation. Our method is evaluated on two video segmentation datasets i.e. SegTrack v1 and FBMS-59 and achieves competitive results in comparison with other state-of-the-art methods.

源语言英语
主期刊名2017 IEEE International Conference on Multimedia and Expo, ICME 2017
出版商IEEE Computer Society
787-792
页数6
ISBN(电子版)9781509060672
DOI
出版状态已出版 - 28 8月 2017
活动2017 IEEE International Conference on Multimedia and Expo, ICME 2017 - Hong Kong, 香港
期限: 10 7月 201714 7月 2017

出版系列

姓名Proceedings - IEEE International Conference on Multimedia and Expo
ISSN(印刷版)1945-7871
ISSN(电子版)1945-788X

会议

会议2017 IEEE International Conference on Multimedia and Expo, ICME 2017
国家/地区香港
Hong Kong
时期10/07/1714/07/17

指纹

探究 'Learning to generate video object segment proposals' 的科研主题。它们共同构成独一无二的指纹。

引用此