Abstract
In the field of skeleton-based gesture recognition, occlusion remains a significant challenge, significantly degrading performance when key joints are occluded or disturbed. To tackle this issue, we propose DiffTrans, a practical conditional diffusion model for occlusion recognition, which enables skeleton-based gesture recognition under high occlusion by generating more likely samples. This study addresses the hand skeleton occlusion problem by framing it as a conditional denoising problem, where unoccluded data serve as observations and occluded data as repair targets. We employ a conditional diffusion model to impute the missing skeleton data and the DSTANet model, which is based on the transformer, to learn the skeleton feature representations. Research results show that the DiffTrans outperforms existing methods under various occlusion modes, maintaining high performance even in scenarios with a high missing rate.
| Original language | English |
|---|---|
| Pages (from-to) | 1970-1974 |
| Number of pages | 5 |
| Journal | IEEE Signal Processing Letters |
| Volume | 32 |
| DOIs | |
| Publication status | Published - 2025 |
| Externally published | Yes |
Keywords
- Conditional diffusion model
- occlusion recognition
- skeleton-based gesture recognition
- transformer
Fingerprint
Dive into the research topics of 'Conditional Diffusion Model for Skeleton-Based Gesture Recognition With Severe Occlusions'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver