Stimulating Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling

Tong Li, Hansen Feng, Lizhi Wang*, Lin Zhu, Zhiwei Xiong, Hua Huang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Image denoising is a fundamental problem in computational photography, where achieving high perception with low distortion is highly demanding. Current methods either struggle with perceptual quality or suffer from significant distortion. Recently, the emerging diffusion model has achieved state-of-the-art performance in various tasks and demonstrates great potential for image denoising. However, stimulating diffusion models for image denoising is not straightforward and requires solving several critical problems. For one thing, the input inconsistency hinders the connection between diffusion models and image denoising. For another, the content inconsistency between the generated image and the desired denoised image introduces distortion. To tackle these problems, we present a novel strategy called the Diffusion Model for Image Denoising (DMID) by understanding and rethinking the diffusion model from a denoising perspective. Our DMID strategy includes an adaptive embedding method that embeds the noisy image into a pre-trained unconditional diffusion model and an adaptive ensembling method that reduces distortion in the denoised image. Our DMID strategy achieves state-of-the-art performance on both distortion-based and perception-based metrics, for both Gaussian and real-world image denoising.

源语言英语
页(从-至)8240-8257
页数18
期刊IEEE Transactions on Pattern Analysis and Machine Intelligence
46
12
DOI
出版状态已出版 - 2024

指纹

探究 'Stimulating Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling' 的科研主题。它们共同构成独一无二的指纹。

引用此