TY - JOUR
T1 - Prior Normality Prompt Transformer for Multiclass Industrial Image Anomaly Detection
AU - Yao, Haiming
AU - Cao, Yunkang
AU - Luo, Wei
AU - Zhang, Weihang
AU - Yu, Wenyong
AU - Shen, Weiming
N1 - Publisher Copyright:
© 2005-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - Image anomaly detection plays a pivotal role in industrial inspection. Traditional approaches often demand distinct models for specific categories, resulting in substantial deployment costs. This raises concerns about multiclass anomaly detection, where a unified model is developed for multiple classes. However, applying conventional methods, particularly reconstruction-based models, directly to multiclass scenarios encounters challenges, such as identical shortcut learning, hindering effective discrimination between normal and abnormal instances. To tackle this issue, our study introduces the prior normality prompt transformer (PNPT) method for multiclass image anomaly detection. PNPT strategically incorporates normal semantics prompting to mitigate the 'identical mapping' problem. This entails integrating a prior normality prompt into the reconstruction process, yielding a dual-stream model. This innovative architecture combines normal prior semantics with abnormal samples, enabling dual-stream reconstruction grounded in both prior knowledge and intrinsic sample characteristics. PNPT comprises four essential modules: 1) class-specific normality prompting pool, 2) hierarchical patch embedding, 3) semantic alignment coupling encoding, and 4) contextual semantic conditional decoding. Experimental validation on diverse benchmark datasets and real-world industrial applications highlights PNPT's superior performance in multiclass industrial anomaly detection.
AB - Image anomaly detection plays a pivotal role in industrial inspection. Traditional approaches often demand distinct models for specific categories, resulting in substantial deployment costs. This raises concerns about multiclass anomaly detection, where a unified model is developed for multiple classes. However, applying conventional methods, particularly reconstruction-based models, directly to multiclass scenarios encounters challenges, such as identical shortcut learning, hindering effective discrimination between normal and abnormal instances. To tackle this issue, our study introduces the prior normality prompt transformer (PNPT) method for multiclass image anomaly detection. PNPT strategically incorporates normal semantics prompting to mitigate the 'identical mapping' problem. This entails integrating a prior normality prompt into the reconstruction process, yielding a dual-stream model. This innovative architecture combines normal prior semantics with abnormal samples, enabling dual-stream reconstruction grounded in both prior knowledge and intrinsic sample characteristics. PNPT comprises four essential modules: 1) class-specific normality prompting pool, 2) hierarchical patch embedding, 3) semantic alignment coupling encoding, and 4) contextual semantic conditional decoding. Experimental validation on diverse benchmark datasets and real-world industrial applications highlights PNPT's superior performance in multiclass industrial anomaly detection.
KW - Defect detection
KW - image anomaly detection
KW - prompting
KW - vision Transformer (ViT)
UR - http://www.scopus.com/inward/record.url?scp=85197659514&partnerID=8YFLogxK
U2 - 10.1109/TII.2024.3413322
DO - 10.1109/TII.2024.3413322
M3 - Article
AN - SCOPUS:85197659514
SN - 1551-3203
VL - 20
SP - 11866
EP - 11876
JO - IEEE Transactions on Industrial Informatics
JF - IEEE Transactions on Industrial Informatics
IS - 10
ER -