TY - GEN
T1 - Design of Face Image Super-Resolution Network Based on Multi-scale Perceptive Inception Dense Block
AU - Liu, Jiayu
AU - Qu, Xiujie
AU - He, Xinyan
AU - Zhu, Qiang
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Within the realm of image data manipulation, enhancing resolution and reconstructing facial visuals stand as pivotal methods designed to restore poorly defined images impacted by unidentified deterioration mechanisms. Traditional non-generative adversarial network (non-GAN) models suffer from high-frequency information reconstruction deficiencies when restoring high-frequency information in images. Utilizing the adversarial learning process between generators and discriminators, GANs are capable of synthesizing high-definition imagery enriched with authentic texture and enhanced clarity. However, they have inherent limitations in terms of pixel-level changes, line distortions and detail recovery. In the context of Mixed Residual of Multi-scale Perceptive Inception Dense Block GAN (MRMPIDBGAN), a Multi-scale Perceptive Inception Block (MPIB) is engineered for the robust retrieval of features and multi-scale data from facial images. Based on MPIB, a Mixed Residual of Multi-scale Perceptive Inception Dense Block (MRMPIDB) is further constructed to form the generator, which facilitates better capture and reconstruction of facial image details. Secondly, to more accurately capture the local texture information of the image, the original VGG discriminator in ESRGAN is replaced by a U-Net discriminator, and spectral normalization techniques are applied to improve the stability of the discriminator. Subsequently, an innovative upsampling technique is formulated to optimize efficacy yet minimize computational demands. Further improvements in the model's efficacy are achieved through the replacement of the traditional VGG loss by a metric grounded in perceptual criteria. Experimental results confirm the effectiveness of this algorithm in super-resolution facial image restoration and demonstrate significant advantages in facial image reconstruction, clear texture capture and feature detailing compared to existing methods.
AB - Within the realm of image data manipulation, enhancing resolution and reconstructing facial visuals stand as pivotal methods designed to restore poorly defined images impacted by unidentified deterioration mechanisms. Traditional non-generative adversarial network (non-GAN) models suffer from high-frequency information reconstruction deficiencies when restoring high-frequency information in images. Utilizing the adversarial learning process between generators and discriminators, GANs are capable of synthesizing high-definition imagery enriched with authentic texture and enhanced clarity. However, they have inherent limitations in terms of pixel-level changes, line distortions and detail recovery. In the context of Mixed Residual of Multi-scale Perceptive Inception Dense Block GAN (MRMPIDBGAN), a Multi-scale Perceptive Inception Block (MPIB) is engineered for the robust retrieval of features and multi-scale data from facial images. Based on MPIB, a Mixed Residual of Multi-scale Perceptive Inception Dense Block (MRMPIDB) is further constructed to form the generator, which facilitates better capture and reconstruction of facial image details. Secondly, to more accurately capture the local texture information of the image, the original VGG discriminator in ESRGAN is replaced by a U-Net discriminator, and spectral normalization techniques are applied to improve the stability of the discriminator. Subsequently, an innovative upsampling technique is formulated to optimize efficacy yet minimize computational demands. Further improvements in the model's efficacy are achieved through the replacement of the traditional VGG loss by a metric grounded in perceptual criteria. Experimental results confirm the effectiveness of this algorithm in super-resolution facial image restoration and demonstrate significant advantages in facial image reconstruction, clear texture capture and feature detailing compared to existing methods.
KW - GANs
KW - facial image enhancement
KW - hybrid upsampling
KW - perceptual metrics
UR - http://www.scopus.com/inward/record.url?scp=85186082669&partnerID=8YFLogxK
U2 - 10.1109/ITAIC58329.2023.10408957
DO - 10.1109/ITAIC58329.2023.10408957
M3 - Conference contribution
AN - SCOPUS:85186082669
T3 - IEEE Joint International Information Technology and Artificial Intelligence Conference (ITAIC)
SP - 1977
EP - 1982
BT - IEEE ITAIC 2023 - IEEE 11th Joint International Information Technology and Artificial Intelligence Conference
A2 - Xu, Bing
A2 - Mou, Kefen
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 11th Joint International Information Technology and Artificial Intelligence Conference, ITAIC 2023
Y2 - 8 December 2023 through 10 December 2023
ER -