Deep neural network based unsupervised video representation

Xinxiao Wu, Kun Wu

科研成果: 期刊稿件文献综述同行评审

摘要

Most video representation methods are supervised in the field of computer vision, requiring large amounts of labeled training video sets which is expensive to scale up to rapidly growing data. To solve this problem, this paper proposes an unsupervised video representation method using deep convolutional neural network. The improved dense trajectory (iDT) is utilized to extract the video blocks which alternately train the convolutional neural network and clusters. The deep convolutional neural network model is trained by iteratively algorithm to get the unsupervised video representations. The proposed model is applied to extract features in HMDB 51 and CCV datasets for tasks of motion recognition and event detection respectively. In the experiments, a 62.6% mean accuracy and a 43.6% mean average prevision (mAP) are obtained respectively which proves the effectiveness of the proposed method.

源语言英语
页(从-至)8-12
页数5
期刊Beijing Jiaotong Daxue Xuebao/Journal of Beijing Jiaotong University
41
6
DOI
出版状态已出版 - 1 12月 2017

指纹

探究 'Deep neural network based unsupervised video representation' 的科研主题。它们共同构成独一无二的指纹。

引用此