TY - JOUR
T1 - Deep Manifold-To-Manifold Transforming Network for Skeleton-Based Action Recognition
AU - Zhang, Tong
AU - Zheng, Wenming
AU - Cui, Zhen
AU - Zong, Yuan
AU - Li, Chaolong
AU - Zhou, Xiaoyan
AU - Yang, Jian
N1 - Publisher Copyright:
© 1999-2012 IEEE.
PY - 2020/11
Y1 - 2020/11
N2 - In this paper, we will investigate skeleton-based action recognition by employing high-order statistics feature and first-order statistics feature, where the high-order statistics feature is characterized by symmetric positive definite (SPD) matrices. Noting that SPD matrices are theoretically embedded on Riemannian manifolds, we propose an end-To-end deep manifold-To-manifold transforming network (DMT-Net), which can make SPD matrices flow from one Riemannian manifold to another one for facilitating the action recognition task. To learn discriminative SPD features from both spatial and temporal dependencies, we propose a neural network model with three novel layers on manifolds: i.e., (1) the local SPD convolutional layer, (2) the non-linear SPD activation layer, and (3) the Riemannian-preserved recursive layer. The SPD property is preserved through all layers without the singular value decomposition (SVD) operation, which has to be conducted in the existing methods with expensive computation cost. Furthermore, a diagonalizing SPD layer is designed to efficiently calculate the final metric for the classification task. Finally, DMT-Net is further fused with a first order layer to capture temporal evolution information. To evaluate our proposed method, we conduct extensive experiments on the task of action recognition, where the input signals are represented as SPD matrices. The experimental results demonstrate that the proposed method is competitive over state-of-The-Art methods.
AB - In this paper, we will investigate skeleton-based action recognition by employing high-order statistics feature and first-order statistics feature, where the high-order statistics feature is characterized by symmetric positive definite (SPD) matrices. Noting that SPD matrices are theoretically embedded on Riemannian manifolds, we propose an end-To-end deep manifold-To-manifold transforming network (DMT-Net), which can make SPD matrices flow from one Riemannian manifold to another one for facilitating the action recognition task. To learn discriminative SPD features from both spatial and temporal dependencies, we propose a neural network model with three novel layers on manifolds: i.e., (1) the local SPD convolutional layer, (2) the non-linear SPD activation layer, and (3) the Riemannian-preserved recursive layer. The SPD property is preserved through all layers without the singular value decomposition (SVD) operation, which has to be conducted in the existing methods with expensive computation cost. Furthermore, a diagonalizing SPD layer is designed to efficiently calculate the final metric for the classification task. Finally, DMT-Net is further fused with a first order layer to capture temporal evolution information. To evaluate our proposed method, we conduct extensive experiments on the task of action recognition, where the input signals are represented as SPD matrices. The experimental results demonstrate that the proposed method is competitive over state-of-The-Art methods.
KW - Riemannian manifold
KW - SPD matrix
KW - action recognition
KW - deep learning
UR - http://www.scopus.com/inward/record.url?scp=85094888571&partnerID=8YFLogxK
U2 - 10.1109/TMM.2020.2966878
DO - 10.1109/TMM.2020.2966878
M3 - Article
AN - SCOPUS:85094888571
SN - 1520-9210
VL - 22
SP - 2926
EP - 2937
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
IS - 11
M1 - 8960323
ER -