Abstract
Human pose estimation has wide applications in health monitoring, disease diagnosis, and motion rehabilitation. These applications rely on motion assessment using kinematic parameters, which can be obtained from the coordinates of human body key points. Most current noncontact key point measurement methods rely on multiple cameras to reconstruct the human body in realistic multiperson interaction and occlusion scenarios. However, sparse camera configurations often lead to limited pose estimation accuracy, while increasing the number of viewpoints to support high-accuracy kinematic analysis reduces efficiency. In this study, MoViSense is proposed for 3-D human pose estimation and kinematic analysis under sparse camera configurations by exploiting the spatial and temporal continuity. Based on the Transformer, the encoder integrates a multiscale gated feedforward (MSFF) module to enhance spatial representations and cross-view alignment, while a dynamic history fusion-deformable multiscale attention (DHF-DMA) module utilizes temporal continuity of human motion to improve robustness under occlusion. In addition, a biomechanical constraint mechanism (BCM) enforces bone-length consistency. A motion kinetics extractor (MKE) converts estimated 3-D key points into interpretable kinematic parameters. Experiments on the CMU Panoptic dataset show that MoViSense achieves an AP50 of 86.49 and an MPJPE of 27.85 mm, outperforming the other representative methods under sparse camera configurations. The relative deviation (RD) of stride length was 5.37%, and the RD of cadence was 1.45%.
| Original language | English |
|---|---|
| Pages (from-to) | 16027-16036 |
| Number of pages | 10 |
| Journal | IEEE Sensors Journal |
| Volume | 26 |
| Issue number | 10 |
| DOIs | |
| Publication status | Published - 1 May 2026 |
Keywords
- Kinematic analysis
- Transformer
- motion assessment
- multiview 3-D human pose estimation
Fingerprint
Dive into the research topics of 'MoViSense: Multiview Spatiotemporal Transformer for 3-D Human Kinematics Sensing'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver