Design of Face Image Super-Resolution Network Based on Multi-scale Perceptive Inception Dense Block

Jiayu Liu*, Xiujie Qu, Xinyan He, Qiang Zhu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Within the realm of image data manipulation, enhancing resolution and reconstructing facial visuals stand as pivotal methods designed to restore poorly defined images impacted by unidentified deterioration mechanisms. Traditional non-generative adversarial network (non-GAN) models suffer from high-frequency information reconstruction deficiencies when restoring high-frequency information in images. Utilizing the adversarial learning process between generators and discriminators, GANs are capable of synthesizing high-definition imagery enriched with authentic texture and enhanced clarity. However, they have inherent limitations in terms of pixel-level changes, line distortions and detail recovery. In the context of Mixed Residual of Multi-scale Perceptive Inception Dense Block GAN (MRMPIDBGAN), a Multi-scale Perceptive Inception Block (MPIB) is engineered for the robust retrieval of features and multi-scale data from facial images. Based on MPIB, a Mixed Residual of Multi-scale Perceptive Inception Dense Block (MRMPIDB) is further constructed to form the generator, which facilitates better capture and reconstruction of facial image details. Secondly, to more accurately capture the local texture information of the image, the original VGG discriminator in ESRGAN is replaced by a U-Net discriminator, and spectral normalization techniques are applied to improve the stability of the discriminator. Subsequently, an innovative upsampling technique is formulated to optimize efficacy yet minimize computational demands. Further improvements in the model's efficacy are achieved through the replacement of the traditional VGG loss by a metric grounded in perceptual criteria. Experimental results confirm the effectiveness of this algorithm in super-resolution facial image restoration and demonstrate significant advantages in facial image reconstruction, clear texture capture and feature detailing compared to existing methods.

Original languageEnglish
Title of host publicationIEEE ITAIC 2023 - IEEE 11th Joint International Information Technology and Artificial Intelligence Conference
EditorsBing Xu, Kefen Mou
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1977-1982
Number of pages6
ISBN (Electronic)9798350333664
DOIs
Publication statusPublished - 2023
Event11th Joint International Information Technology and Artificial Intelligence Conference, ITAIC 2023 - Chongqing, China
Duration: 8 Dec 202310 Dec 2023

Publication series

NameIEEE Joint International Information Technology and Artificial Intelligence Conference (ITAIC)
ISSN (Print)2693-2865

Conference

Conference11th Joint International Information Technology and Artificial Intelligence Conference, ITAIC 2023
Country/TerritoryChina
CityChongqing
Period8/12/2310/12/23

Keywords

  • GANs
  • facial image enhancement
  • hybrid upsampling
  • perceptual metrics

Fingerprint

Dive into the research topics of 'Design of Face Image Super-Resolution Network Based on Multi-scale Perceptive Inception Dense Block'. Together they form a unique fingerprint.

Cite this