跳到主要导航 跳到搜索 跳到主要内容

MoViSense: Multiview Spatiotemporal Transformer for 3-D Human Kinematics Sensing

  • Dezhi Zheng
  • , Zhiyi Yang
  • , Wenfeng Liu
  • , Xiao Liang
  • , Chun Hu*
  • , Kang Ma*
  • *此作品的通讯作者
  • Beijing Institute of Technology

科研成果: 期刊稿件文章同行评审

摘要

Human pose estimation has wide applications in health monitoring, disease diagnosis, and motion rehabilitation. These applications rely on motion assessment using kinematic parameters, which can be obtained from the coordinates of human body key points. Most current noncontact key point measurement methods rely on multiple cameras to reconstruct the human body in realistic multiperson interaction and occlusion scenarios. However, sparse camera configurations often lead to limited pose estimation accuracy, while increasing the number of viewpoints to support high-accuracy kinematic analysis reduces efficiency. In this study, MoViSense is proposed for 3-D human pose estimation and kinematic analysis under sparse camera configurations by exploiting the spatial and temporal continuity. Based on the Transformer, the encoder integrates a multiscale gated feedforward (MSFF) module to enhance spatial representations and cross-view alignment, while a dynamic history fusion-deformable multiscale attention (DHF-DMA) module utilizes temporal continuity of human motion to improve robustness under occlusion. In addition, a biomechanical constraint mechanism (BCM) enforces bone-length consistency. A motion kinetics extractor (MKE) converts estimated 3-D key points into interpretable kinematic parameters. Experiments on the CMU Panoptic dataset show that MoViSense achieves an AP50 of 86.49 and an MPJPE of 27.85 mm, outperforming the other representative methods under sparse camera configurations. The relative deviation (RD) of stride length was 5.37%, and the RD of cadence was 1.45%.

源语言英语
页(从-至)16027-16036
页数10
期刊IEEE Sensors Journal
26
10
DOI
出版状态已出版 - 1 5月 2026

指纹

探究 'MoViSense: Multiview Spatiotemporal Transformer for 3-D Human Kinematics Sensing' 的科研主题。它们共同构成独一无二的指纹。

引用此