跳到主要导航 跳到搜索 跳到主要内容

MD-STGCN: Dynamic topology and multi-scale temporal modeling for skeleton-based action recognition

  • Shiran Zhu
  • , Ronghua Li
  • , Henan Hu*
  • *此作品的通讯作者
  • Dalian Jiaotong University

科研成果: 期刊稿件文章同行评审

摘要

Skeleton-based action recognition is a cornerstone for complex activity analysis, yet current Graph Convolutional Network (GCN) paradigms are constrained by three bottlenecks: rigid, predefined spatial topologies that fail to adapt to dynamic motion variations; channel-wise topology homogenization that limits multi-level semantic expressiveness; and disjointed spatio-temporal modeling hampered by restricted temporal receptive fields. To address these gaps, we propose the Multi-scale Dynamic Topology Spatio-Temporal Graph Convolutional Network (MD-STGCN). MD-STGCN integrates two complementary modules: a Dynamic Topology Graph Convolution (DT-GC) module that employs a learnable channel-wise topology gate to produce channel-adaptive sub-topologies and adaptive edge weights for finer spatial reasoning, and a Multi-Scale Temporal Convolution (MS-TC) module that uses a lightweight multi-branch design with depthwise-separable temporal convolutions to efficiently capture both short- and long-range temporal dependencies. Extensive experiments on NTU RGB+D and NTU RGB+D 120 demonstrate that MD-STGCN achieves state-of-the-art performance: the joint stream achieves a Top-1 accuracy of 99.0% on NTU cross-view, yielding a 1.6% absolute gain over STGCN++. Moreover, MD-STGCN requires 0.33M parameters and 0.45 GFLOPs, representing a substantial reduction in model complexity compared with CTR-GCN and MS-G3D, thereby offering a superior accuracy–efficiency trade-off. MD-STGCN is a promising candidate for resource-constrained scenarios, particularly in real-time surveillance and human–computer interaction systems.

源语言英语
文章编号123572
期刊Information Sciences
752
DOI
出版状态已出版 - 5 10月 2026
已对外发布

指纹

探究 'MD-STGCN: Dynamic topology and multi-scale temporal modeling for skeleton-based action recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此