TY - JOUR
T1 - Similarity Knowledge Distillation with Calibrated Mask
AU - Wang, Qi
AU - Yu, Wenxin
AU - Che, Lu
AU - Liu, Chang
AU - Zhang, Zhiqiang
AU - Gong, Jun
AU - Chen, Peng
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - In this paper, we propose a novel and efficient method for knowledge distillation, which is structurally simple and requires negligible computation overhead. Our method includes three modules. The first module is the calibrated mask, which avoids the teacher model's incorrect representation to disturb the student model's training; the second module and the third module improve the performance of the student model by the similarity of the sample and the process, respectively. The student model attains better performance in qualitative and quantitative evaluation through the judicious amalgamation of these three modules. Our method is experimented with through rigorous validation of canonical datasets, including CIFAR-100 and TinyImageNet. The experimental corroboration conclusively attests to the better performance of our method, soaring above the extant most state-of-the-art on both subjective and objective dimensions.
AB - In this paper, we propose a novel and efficient method for knowledge distillation, which is structurally simple and requires negligible computation overhead. Our method includes three modules. The first module is the calibrated mask, which avoids the teacher model's incorrect representation to disturb the student model's training; the second module and the third module improve the performance of the student model by the similarity of the sample and the process, respectively. The student model attains better performance in qualitative and quantitative evaluation through the judicious amalgamation of these three modules. Our method is experimented with through rigorous validation of canonical datasets, including CIFAR-100 and TinyImageNet. The experimental corroboration conclusively attests to the better performance of our method, soaring above the extant most state-of-the-art on both subjective and objective dimensions.
KW - Classification
KW - Computer Vision
KW - Deep Learning
KW - Knowledge Distillation
KW - Model Compression
UR - https://www.scopus.com/pages/publications/105001573144
U2 - 10.1109/ICASSP48485.2024.10447709
DO - 10.1109/ICASSP48485.2024.10447709
M3 - Conference article
AN - SCOPUS:105001573144
SN - 0736-7791
SP - 5770
EP - 5774
JO - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
JF - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
T2 - 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
Y2 - 14 April 2024 through 19 April 2024
ER -