TY - GEN
T1 - A pre-training strategy for convolutional neural network applied to Chinese digital gesture recognition
AU - Li, Yawei
AU - Yang, Yuliang
AU - Chen, Yueyun
AU - Zhu, Mengyu
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/10/7
Y1 - 2016/10/7
N2 - In this paper, we present an approach to classify Chinese digital gesture based on convolutional neural network (CNN). Principal Component Analysis (PCA) is employed to learn convolution kernels as the pre-training strategy. The learned convolution kernels are used for extracting features instead of the random convolution kernels. The convolutional layers can be directly implemented without any further training, such as Back Propagation (BP). For better understanding, we name the proposed architecture for PCA-based Convolutional Neural Network (PCNN). The dataset is divided into six gesture classes including 14500 gesture images, with 12000 images for training and 2500 images for testing. We examine the robustness of the PCNN against noises and distortions. In addition, the MNIST database of handwritten digits is employed to assess the suitability of the PCNN. Different from the CNN, the PCNN reduces the high computational cost of convolution kernels training. About one-fifth of the training time is shortened. The result shows that our approach classifies six gesture classes with 99.92% accuracy. Multiple experiments manifest the PCNN serving as an efficient approach for image processing and object recognition.
AB - In this paper, we present an approach to classify Chinese digital gesture based on convolutional neural network (CNN). Principal Component Analysis (PCA) is employed to learn convolution kernels as the pre-training strategy. The learned convolution kernels are used for extracting features instead of the random convolution kernels. The convolutional layers can be directly implemented without any further training, such as Back Propagation (BP). For better understanding, we name the proposed architecture for PCA-based Convolutional Neural Network (PCNN). The dataset is divided into six gesture classes including 14500 gesture images, with 12000 images for training and 2500 images for testing. We examine the robustness of the PCNN against noises and distortions. In addition, the MNIST database of handwritten digits is employed to assess the suitability of the PCNN. Different from the CNN, the PCNN reduces the high computational cost of convolution kernels training. About one-fifth of the training time is shortened. The result shows that our approach classifies six gesture classes with 99.92% accuracy. Multiple experiments manifest the PCNN serving as an efficient approach for image processing and object recognition.
KW - Chinese digital gesture recognition
KW - convolution kernels
KW - convolutional neural network
KW - principal component analysis
UR - http://www.scopus.com/inward/record.url?scp=84994478024&partnerID=8YFLogxK
U2 - 10.1109/ICCSN.2016.7586597
DO - 10.1109/ICCSN.2016.7586597
M3 - Conference contribution
AN - SCOPUS:84994478024
T3 - Proceedings of 2016 8th IEEE International Conference on Communication Software and Networks, ICCSN 2016
SP - 620
EP - 624
BT - Proceedings of 2016 8th IEEE International Conference on Communication Software and Networks, ICCSN 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 8th IEEE International Conference on Communication Software and Networks, ICCSN 2016
Y2 - 4 June 2016 through 6 June 2016
ER -