Video annotation by incremental learning from grouped heterogeneous sources

Han Wang*, Hao Song, Xinxiao Wu, Yunde Jia

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

Transfer learning has shown promising results in leveraging loosely labeledWeb images (source domain) to learn a robust classifier for the unlabeled consumer videos (target domain). Existing transfer learning methods typically apply source domain data to learn a fixed model for predicting target domain data once and for all, ignoring rapidly updating Web data and continuously changes of users requirements. We propose an incremental transfer learning framework, in which heterogeneous knowledge are integrated and incrementally added to update the target classifier during learning process. Under the framework, images (image source domain) queried from Web image search engine and videos (video source domain) from existing action datasets are adopted to provide static information and motion information of the target video, respectively. For the image source domain, images are partitioned into several groups according to their semantic information. And for the video source domain, videos are divided in the same way. Unlike traditional methods which measure relevance between the source group and the whole target domain videos, the group weights in this paper are treated as latent variables for each target domain video and learned automatically according to the probability distribution difference between the individual source group and target domain videos. Experimental results on the two challenging video datasets (i.e., CCV and Kodak) demonstrate the effectiveness of our proposed method.

Original languageEnglish
Title of host publicationComputer Vision - ACCV 2014 - 12th Asian Conference on Computer Vision, Revised Selected Papers
EditorsDaniel Cremers, Hideo Saito, Ian Reid, Ming-Hsuan Yang
PublisherSpringer Verlag
Pages493-507
Number of pages15
ISBN (Electronic)9783319168135
DOIs
Publication statusPublished - 2015
Event12th Asian Conference on Computer Vision, ACCV 2014 - Singapore, Singapore
Duration: 1 Nov 20145 Nov 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9007
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference12th Asian Conference on Computer Vision, ACCV 2014
Country/TerritorySingapore
CitySingapore
Period1/11/145/11/14

Fingerprint

Dive into the research topics of 'Video annotation by incremental learning from grouped heterogeneous sources'. Together they form a unique fingerprint.

Cite this