Adaptive learning for celebrity identification with video context

Chao Xiong; Guangyu Gao; Zhengjun Zha; Shuicheng Yan; Huadong Ma; Tae Kyun Kim

doi:10.1109/TMM.2014.2316475

Adaptive learning for celebrity identification with video context

Chao Xiong, Guangyu Gao^*, Zhengjun Zha, Shuicheng Yan, Huadong Ma, Tae Kyun Kim

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

7 引用（Scopus）

摘要

In this paper, we propose a novel semi-supervised learning strategy to address the problem of celebrity identification. The video context information is explored to facilitate the learning process based on the assumption that faces in the same video track share the same identity. Once a frame within a track is recognized confidently, the label can be propagated through the whole track, referred to as the confident track. More specifically, given a few static images and vast face videos, an initial weak classifier is trained and gradually evolves by iteratively promoting the confident tracks into the 'labeled' set. The iterative selection process enriches the diversity of the 'labeled' set such that the performance of the classifier is gradually improved. This learning theme may suffer from semantic drifting caused by errors in selecting the confident tracks. To address this issue, we propose to treat the selected frames as related samples - an intermediate state between labeled and unlabeled instead of labeled as in the traditional approach. To evaluate the performance, we construct a new dataset, which includes 3000 static images and 2700 face tracks of 30 celebrities. Comprehensive evaluations on this dataset and a public video dataset indicate significant improvement of our approach over established baseline methods.

源语言	英语
文章编号	6786434
页（从-至）	1473-1485
页数	13
期刊	IEEE Transactions on Multimedia
卷	16
期	5
DOI	https://doi.org/10.1109/TMM.2014.2316475
出版状态	已出版 - 8月 2014

访问文件

10.1109/TMM.2014.2316475

其它文件与链接

链接到 Scopus 的出版物

引用此

Xiong, C., Gao, G., Zha, Z., Yan, S., Ma, H., & Kim, T. K. (2014). Adaptive learning for celebrity identification with video context. IEEE Transactions on Multimedia, 16(5), 1473-1485. 文章 6786434. https://doi.org/10.1109/TMM.2014.2316475

@article{d04ecd83c5594ef1a83e55a2910358da,

title = "Adaptive learning for celebrity identification with video context",

abstract = "In this paper, we propose a novel semi-supervised learning strategy to address the problem of celebrity identification. The video context information is explored to facilitate the learning process based on the assumption that faces in the same video track share the same identity. Once a frame within a track is recognized confidently, the label can be propagated through the whole track, referred to as the confident track. More specifically, given a few static images and vast face videos, an initial weak classifier is trained and gradually evolves by iteratively promoting the confident tracks into the 'labeled' set. The iterative selection process enriches the diversity of the 'labeled' set such that the performance of the classifier is gradually improved. This learning theme may suffer from semantic drifting caused by errors in selecting the confident tracks. To address this issue, we propose to treat the selected frames as related samples - an intermediate state between labeled and unlabeled instead of labeled as in the traditional approach. To evaluate the performance, we construct a new dataset, which includes 3000 static images and 2700 face tracks of 30 celebrities. Comprehensive evaluations on this dataset and a public video dataset indicate significant improvement of our approach over established baseline methods.",

keywords = "Adaptive learning, celebrity identification, related samples, semi-supervised learning, video context",

author = "Chao Xiong and Guangyu Gao and Zhengjun Zha and Shuicheng Yan and Huadong Ma and Kim, {Tae Kyun}",

year = "2014",

month = aug,

doi = "10.1109/TMM.2014.2316475",

language = "English",

volume = "16",

pages = "1473--1485",

journal = "IEEE Transactions on Multimedia",

issn = "1520-9210",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "5",

}

TY - JOUR

T1 - Adaptive learning for celebrity identification with video context

AU - Xiong, Chao

AU - Gao, Guangyu

AU - Zha, Zhengjun

AU - Yan, Shuicheng

AU - Ma, Huadong

AU - Kim, Tae Kyun

PY - 2014/8

Y1 - 2014/8

N2 - In this paper, we propose a novel semi-supervised learning strategy to address the problem of celebrity identification. The video context information is explored to facilitate the learning process based on the assumption that faces in the same video track share the same identity. Once a frame within a track is recognized confidently, the label can be propagated through the whole track, referred to as the confident track. More specifically, given a few static images and vast face videos, an initial weak classifier is trained and gradually evolves by iteratively promoting the confident tracks into the 'labeled' set. The iterative selection process enriches the diversity of the 'labeled' set such that the performance of the classifier is gradually improved. This learning theme may suffer from semantic drifting caused by errors in selecting the confident tracks. To address this issue, we propose to treat the selected frames as related samples - an intermediate state between labeled and unlabeled instead of labeled as in the traditional approach. To evaluate the performance, we construct a new dataset, which includes 3000 static images and 2700 face tracks of 30 celebrities. Comprehensive evaluations on this dataset and a public video dataset indicate significant improvement of our approach over established baseline methods.

AB - In this paper, we propose a novel semi-supervised learning strategy to address the problem of celebrity identification. The video context information is explored to facilitate the learning process based on the assumption that faces in the same video track share the same identity. Once a frame within a track is recognized confidently, the label can be propagated through the whole track, referred to as the confident track. More specifically, given a few static images and vast face videos, an initial weak classifier is trained and gradually evolves by iteratively promoting the confident tracks into the 'labeled' set. The iterative selection process enriches the diversity of the 'labeled' set such that the performance of the classifier is gradually improved. This learning theme may suffer from semantic drifting caused by errors in selecting the confident tracks. To address this issue, we propose to treat the selected frames as related samples - an intermediate state between labeled and unlabeled instead of labeled as in the traditional approach. To evaluate the performance, we construct a new dataset, which includes 3000 static images and 2700 face tracks of 30 celebrities. Comprehensive evaluations on this dataset and a public video dataset indicate significant improvement of our approach over established baseline methods.

KW - Adaptive learning

KW - celebrity identification

KW - related samples

KW - semi-supervised learning

KW - video context

UR - http://www.scopus.com/inward/record.url?scp=84904732573&partnerID=8YFLogxK

U2 - 10.1109/TMM.2014.2316475

DO - 10.1109/TMM.2014.2316475

M3 - Article

AN - SCOPUS:84904732573

SN - 1520-9210

VL - 16

SP - 1473

EP - 1485

JO - IEEE Transactions on Multimedia

JF - IEEE Transactions on Multimedia

IS - 5

M1 - 6786434

ER -

Adaptive learning for celebrity identification with video context

摘要

访问文件

其它文件与链接

指纹

引用此