Two-Stage Self-Supervised Learning for Facial Action Unit Recognition

Hao Cheng, Xiang Xie, Shuang Liang

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper proposes a two-stage self-supervised method for facial action unit recognition. First, an auto-encoder approach is applied, with an encoder which operates on a small proportion e.g., 40% of images patches. The decoder reconstructs the original image from latent features and learnable mask tokens. After training, the encoder is adapted to the task of AU recognition, yet poor results are observed in certain AU classes. To address the problem, contrastive learning is proposed to learn discriminative features. This method uses images from the VGG-Face2 dataset, which vary in terms of head pose, age and background. Experiments on AU recognition show that the two-stage method strengthens the representation quality. Compared to previous self-supervised methods, the pre-trained encoder achieves the best linear probing result on DISFA dataset, with the F1-score of 53.8%. Fine-tuning experiment is also conducted, and obtains the F1-score of 59.9%, with a roughly 3% gap to existing state-of-the-art method. The two-stage training method is easy to implement and expandable for further research.

源语言英语
主期刊名IVSP 2022 - 2022 4th International Conference on Image, Video and Signal Processing
出版商Association for Computing Machinery
80-84
页数5
ISBN(电子版)9781450387415
DOI
出版状态已出版 - 18 3月 2022
活动4th International Conference on Image, Video and Signal Processing, IVSP 2022 - Virtual, Online, 新加坡
期限: 18 3月 202220 3月 2022

出版系列

姓名ACM International Conference Proceeding Series

会议

会议4th International Conference on Image, Video and Signal Processing, IVSP 2022
国家/地区新加坡
Virtual, Online
时期18/03/2220/03/22

指纹

探究 'Two-Stage Self-Supervised Learning for Facial Action Unit Recognition' 的科研主题。它们共同构成独一无二的指纹。

引用此