Structure-Guided Image Inpainting Based on Multi-Scale Attention Pyramid Network

Jun Gong, Senlin Luo, Wenxin Yu, Liang Nie*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Current single-view image inpainting methods often suffer from low image information utilization and suboptimal repair outcomes. To address these challenges, this paper introduces a novel image inpainting framework that leverages a structure-guided multi-scale attention pyramid network. This network consists of a structural repair network and a multi-scale attention pyramid semantic repair network. The structural repair component utilizes a dual-branch U-Net network for robust structure prediction under strong constraints. The predicted structural view then serves as auxiliary information for the semantic repair network. This latter network exploits the pyramid structure to extract multi-scale features of the image, which are further refined through an attention feature fusion module. Additionally, a separable gated convolution strategy is employed during feature extraction to minimize the impact of invalid information from missing areas, thereby enhancing the restoration quality. Experiments conducted on standard datasets such as Paris Street View and CelebA demonstrate the superiority of our approach over existing methods through quantitative and qualitative comparisons. Further ablation studies, by incrementally integrating proposed mechanisms into a baseline model, substantiate the effectiveness of our multi-view restoration strategy, separable gated convolution, and multi-scale attention feature fusion.

源语言英语
文章编号8325
期刊Applied Sciences (Switzerland)
14
18
DOI
出版状态已出版 - 9月 2024

指纹

探究 'Structure-Guided Image Inpainting Based on Multi-Scale Attention Pyramid Network' 的科研主题。它们共同构成独一无二的指纹。

引用此