Two-way feature-aligned and attention-rectified adversarial training

Haitao Zhang, Fan Jia, Quanxin Zhang, Yahong Han, Xiaohui Kuang, Yu An Tan

科研成果: 书/报告/会议事项章节会议稿件同行评审

2 引用 (Scopus)

摘要

Adversarial training increases robustness by augmenting training data with adversarial examples. However, vanilla adversarial training may be overfitting to certain adversarial attacks. Small perturbations in images bring in error which is gradually amplified when forwarded through the model so that the error leads to wrong classification. Besides, small perturbations will also distract classifier's attention to significant features that are relevant to the true label. In this paper, we propose a novel two-way feature-aligned and attention-rectified adversarial training (FAAR) to improve adversarial training (AT). FAAR utilizes two-way feature alignment and attention rectification to mitigate the problems mentioned above. FAAR effectively suppresses perturbations in lowlevel, high-level and global features by moving features of perturbed images towards those of clean images with twoway feature alignment. It also leads the model into focusing more on useful features which are correlated with true label through rectifying gradient-weighted attention. Besides, feature alignment activates attention rectification by reducing perturbations in high-level feature. Our proposed method FAAR surpasses other existing AT methods in three aspects. First, it pushes the model to keep invariant when dealing with different adversarial attacks and different magnitude of perturbations. Second, it can be applied to any convolution neural networks. Third, the training process is end-to-end. For experiments, FAAR shows promising defense performance on CIFAR-10 and ImageNet.

源语言英语
主期刊名2020 IEEE International Conference on Multimedia and Expo, ICME 2020
出版商IEEE Computer Society
ISBN(电子版)9781728113319
DOI
出版状态已出版 - 7月 2020
活动2020 IEEE International Conference on Multimedia and Expo, ICME 2020 - London, 英国
期限: 6 7月 202010 7月 2020

出版系列

姓名Proceedings - IEEE International Conference on Multimedia and Expo
2020-July
ISSN(印刷版)1945-7871
ISSN(电子版)1945-788X

会议

会议2020 IEEE International Conference on Multimedia and Expo, ICME 2020
国家/地区英国
London
时期6/07/2010/07/20

指纹

探究 'Two-way feature-aligned and attention-rectified adversarial training' 的科研主题。它们共同构成独一无二的指纹。

引用此