Stimulating Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling

Tong Li; Hansen Feng; Lizhi Wang; Lin Zhu; Zhiwei Xiong; Hua Huang

doi:10.1109/TPAMI.2024.3432812

Stimulating Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling

Tong Li, Hansen Feng, Lizhi Wang^*, Lin Zhu, Zhiwei Xiong, Hua Huang

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Image denoising is a fundamental problem in computational photography, where achieving high perception with low distortion is highly demanding. Current methods either struggle with perceptual quality or suffer from significant distortion. Recently, the emerging diffusion model has achieved state-of-the-art performance in various tasks and demonstrates great potential for image denoising. However, stimulating diffusion models for image denoising is not straightforward and requires solving several critical problems. For one thing, the input inconsistency hinders the connection between diffusion models and image denoising. For another, the content inconsistency between the generated image and the desired denoised image introduces distortion. To tackle these problems, we present a novel strategy called the Diffusion Model for Image Denoising (DMID) by understanding and rethinking the diffusion model from a denoising perspective. Our DMID strategy includes an adaptive embedding method that embeds the noisy image into a pre-trained unconditional diffusion model and an adaptive ensembling method that reduces distortion in the denoised image. Our DMID strategy achieves state-of-the-art performance on both distortion-based and perception-based metrics, for both Gaussian and real-world image denoising.

源语言	英语
页（从-至）	8240-8257
页数	18
期刊	IEEE Transactions on Pattern Analysis and Machine Intelligence
卷	46
期	12
DOI	https://doi.org/10.1109/TPAMI.2024.3432812
出版状态	已出版 - 2024

访问文件

10.1109/TPAMI.2024.3432812

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{5f7d2b48339b419da9dd3589ac2fde3d,

title = "Stimulating Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling",

abstract = "Image denoising is a fundamental problem in computational photography, where achieving high perception with low distortion is highly demanding. Current methods either struggle with perceptual quality or suffer from significant distortion. Recently, the emerging diffusion model has achieved state-of-the-art performance in various tasks and demonstrates great potential for image denoising. However, stimulating diffusion models for image denoising is not straightforward and requires solving several critical problems. For one thing, the input inconsistency hinders the connection between diffusion models and image denoising. For another, the content inconsistency between the generated image and the desired denoised image introduces distortion. To tackle these problems, we present a novel strategy called the Diffusion Model for Image Denoising (DMID) by understanding and rethinking the diffusion model from a denoising perspective. Our DMID strategy includes an adaptive embedding method that embeds the noisy image into a pre-trained unconditional diffusion model and an adaptive ensembling method that reduces distortion in the denoised image. Our DMID strategy achieves state-of-the-art performance on both distortion-based and perception-based metrics, for both Gaussian and real-world image denoising.",

keywords = "Computational photography, diffusion model, distortion-perception, image denoising, self-supervised",

author = "Tong Li and Hansen Feng and Lizhi Wang and Lin Zhu and Zhiwei Xiong and Hua Huang",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2024",

doi = "10.1109/TPAMI.2024.3432812",

language = "English",

volume = "46",

pages = "8240--8257",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "12",

}

TY - JOUR

T1 - Stimulating Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling

AU - Li, Tong

AU - Feng, Hansen

AU - Wang, Lizhi

AU - Zhu, Lin

AU - Xiong, Zhiwei

AU - Huang, Hua

PY - 2024

Y1 - 2024

N2 - Image denoising is a fundamental problem in computational photography, where achieving high perception with low distortion is highly demanding. Current methods either struggle with perceptual quality or suffer from significant distortion. Recently, the emerging diffusion model has achieved state-of-the-art performance in various tasks and demonstrates great potential for image denoising. However, stimulating diffusion models for image denoising is not straightforward and requires solving several critical problems. For one thing, the input inconsistency hinders the connection between diffusion models and image denoising. For another, the content inconsistency between the generated image and the desired denoised image introduces distortion. To tackle these problems, we present a novel strategy called the Diffusion Model for Image Denoising (DMID) by understanding and rethinking the diffusion model from a denoising perspective. Our DMID strategy includes an adaptive embedding method that embeds the noisy image into a pre-trained unconditional diffusion model and an adaptive ensembling method that reduces distortion in the denoised image. Our DMID strategy achieves state-of-the-art performance on both distortion-based and perception-based metrics, for both Gaussian and real-world image denoising.

AB - Image denoising is a fundamental problem in computational photography, where achieving high perception with low distortion is highly demanding. Current methods either struggle with perceptual quality or suffer from significant distortion. Recently, the emerging diffusion model has achieved state-of-the-art performance in various tasks and demonstrates great potential for image denoising. However, stimulating diffusion models for image denoising is not straightforward and requires solving several critical problems. For one thing, the input inconsistency hinders the connection between diffusion models and image denoising. For another, the content inconsistency between the generated image and the desired denoised image introduces distortion. To tackle these problems, we present a novel strategy called the Diffusion Model for Image Denoising (DMID) by understanding and rethinking the diffusion model from a denoising perspective. Our DMID strategy includes an adaptive embedding method that embeds the noisy image into a pre-trained unconditional diffusion model and an adaptive ensembling method that reduces distortion in the denoised image. Our DMID strategy achieves state-of-the-art performance on both distortion-based and perception-based metrics, for both Gaussian and real-world image denoising.

KW - Computational photography

KW - diffusion model

KW - distortion-perception

KW - image denoising

KW - self-supervised

UR - http://www.scopus.com/inward/record.url?scp=85199381444&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2024.3432812

DO - 10.1109/TPAMI.2024.3432812

M3 - Article

AN - SCOPUS:85199381444

SN - 0162-8828

VL - 46

SP - 8240

EP - 8257

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 12

ER -

Stimulating Diffusion Model for Image Denoising via Adaptive Embedding and Ensembling

摘要

访问文件

其它文件与链接

指纹

引用此