TY - JOUR
T1 - Deep neural network based unsupervised video representation
AU - Wu, Xinxiao
AU - Wu, Kun
N1 - Publisher Copyright:
© 2017, Editorial Department of Journal of Beijing Jiaotong University. All right reserved.
PY - 2017/12/1
Y1 - 2017/12/1
N2 - Most video representation methods are supervised in the field of computer vision, requiring large amounts of labeled training video sets which is expensive to scale up to rapidly growing data. To solve this problem, this paper proposes an unsupervised video representation method using deep convolutional neural network. The improved dense trajectory (iDT) is utilized to extract the video blocks which alternately train the convolutional neural network and clusters. The deep convolutional neural network model is trained by iteratively algorithm to get the unsupervised video representations. The proposed model is applied to extract features in HMDB 51 and CCV datasets for tasks of motion recognition and event detection respectively. In the experiments, a 62.6% mean accuracy and a 43.6% mean average prevision (mAP) are obtained respectively which proves the effectiveness of the proposed method.
AB - Most video representation methods are supervised in the field of computer vision, requiring large amounts of labeled training video sets which is expensive to scale up to rapidly growing data. To solve this problem, this paper proposes an unsupervised video representation method using deep convolutional neural network. The improved dense trajectory (iDT) is utilized to extract the video blocks which alternately train the convolutional neural network and clusters. The deep convolutional neural network model is trained by iteratively algorithm to get the unsupervised video representations. The proposed model is applied to extract features in HMDB 51 and CCV datasets for tasks of motion recognition and event detection respectively. In the experiments, a 62.6% mean accuracy and a 43.6% mean average prevision (mAP) are obtained respectively which proves the effectiveness of the proposed method.
KW - Convolution neural networks
KW - Unsupervised learning
KW - Video representation
UR - http://www.scopus.com/inward/record.url?scp=85048325119&partnerID=8YFLogxK
U2 - 10.11860/j.issn.1673-0291.2017.06.002
DO - 10.11860/j.issn.1673-0291.2017.06.002
M3 - Review article
AN - SCOPUS:85048325119
SN - 1673-0291
VL - 41
SP - 8
EP - 12
JO - Beijing Jiaotong Daxue Xuebao/Journal of Beijing Jiaotong University
JF - Beijing Jiaotong Daxue Xuebao/Journal of Beijing Jiaotong University
IS - 6
ER -