TY - JOUR
T1 - Conditional Diffusion Model for Skeleton-Based Gesture Recognition With Severe Occlusions
AU - Liu, Jinting
AU - Gan, Minggang
AU - Du, Yao
AU - Guan, Keyi
AU - Guo, Jia
N1 - Publisher Copyright:
© 1994-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - In the field of skeleton-based gesture recognition, occlusion remains a significant challenge, significantly degrading performance when key joints are occluded or disturbed. To tackle this issue, we propose DiffTrans, a practical conditional diffusion model for occlusion recognition, which enables skeleton-based gesture recognition under high occlusion by generating more likely samples. This study addresses the hand skeleton occlusion problem by framing it as a conditional denoising problem, where unoccluded data serve as observations and occluded data as repair targets. We employ a conditional diffusion model to impute the missing skeleton data and the DSTANet model, which is based on the transformer, to learn the skeleton feature representations. Research results show that the DiffTrans outperforms existing methods under various occlusion modes, maintaining high performance even in scenarios with a high missing rate.
AB - In the field of skeleton-based gesture recognition, occlusion remains a significant challenge, significantly degrading performance when key joints are occluded or disturbed. To tackle this issue, we propose DiffTrans, a practical conditional diffusion model for occlusion recognition, which enables skeleton-based gesture recognition under high occlusion by generating more likely samples. This study addresses the hand skeleton occlusion problem by framing it as a conditional denoising problem, where unoccluded data serve as observations and occluded data as repair targets. We employ a conditional diffusion model to impute the missing skeleton data and the DSTANet model, which is based on the transformer, to learn the skeleton feature representations. Research results show that the DiffTrans outperforms existing methods under various occlusion modes, maintaining high performance even in scenarios with a high missing rate.
KW - Conditional diffusion model
KW - occlusion recognition
KW - skeleton-based gesture recognition
KW - transformer
UR - http://www.scopus.com/inward/record.url?scp=105003376697&partnerID=8YFLogxK
U2 - 10.1109/LSP.2025.3563445
DO - 10.1109/LSP.2025.3563445
M3 - Article
AN - SCOPUS:105003376697
SN - 1070-9908
VL - 32
SP - 1970
EP - 1974
JO - IEEE Signal Processing Letters
JF - IEEE Signal Processing Letters
ER -