TY - GEN
T1 - Partially Occluded Face Expression Recognition with CBAM-Based Residual Network for Teaching Scene
AU - Bai, Yanxing
AU - Chen, Luefeng
AU - Li, Min
AU - Wu, Min
AU - Pedrycz, Witold
AU - Hirota, Kaoru
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In this paper, a deep residual network based on convolutional block attention module (CBAM) is proposed, which is utilized for feature extraction of partially occluded face expression data. The proposed method overcomes the problem of localized occlusion face feature extraction by focusing on the regions and channels containing important information in the occluded face data through CBAM. Multi-task cascaded convolutional networks (MTCNN) are firstly utilized to localize the key regions of face emotion, and then deep emotion features are extracted by CBAM-ResNet network. The final emotion labels are generated. The effectiveness of this paper's method is verified on the RAF-DB dataset and the occluded CK+ dataset. The experimental accuracy in the RAF-DB dataset is 76.3%, which is 3.74% and 1.64% higher than the accuracy produced by the method of RGBT, and the WLS-RF, respectively. Application experiments are carried out in the real teaching scenario, which verifies the applicability of the algorithm in the real teaching scene.
AB - In this paper, a deep residual network based on convolutional block attention module (CBAM) is proposed, which is utilized for feature extraction of partially occluded face expression data. The proposed method overcomes the problem of localized occlusion face feature extraction by focusing on the regions and channels containing important information in the occluded face data through CBAM. Multi-task cascaded convolutional networks (MTCNN) are firstly utilized to localize the key regions of face emotion, and then deep emotion features are extracted by CBAM-ResNet network. The final emotion labels are generated. The effectiveness of this paper's method is verified on the RAF-DB dataset and the occluded CK+ dataset. The experimental accuracy in the RAF-DB dataset is 76.3%, which is 3.74% and 1.64% higher than the accuracy produced by the method of RGBT, and the WLS-RF, respectively. Application experiments are carried out in the real teaching scenario, which verifies the applicability of the algorithm in the real teaching scene.
KW - Attention mechanism
KW - CBAM
KW - Partially occluded facial emotion recognition
KW - Teaching scene
UR - http://www.scopus.com/inward/record.url?scp=85189324781&partnerID=8YFLogxK
U2 - 10.1109/CAC59555.2023.10450205
DO - 10.1109/CAC59555.2023.10450205
M3 - Conference contribution
AN - SCOPUS:85189324781
T3 - Proceedings - 2023 China Automation Congress, CAC 2023
SP - 6052
EP - 6057
BT - Proceedings - 2023 China Automation Congress, CAC 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 China Automation Congress, CAC 2023
Y2 - 17 November 2023 through 19 November 2023
ER -