CSPDAE: Colorization via SuperPixel Downsampler Denoising AutoEncoder towards enhanced color structural clarity

Wenqu Zhao, Lingxue Wang*, Miao Dong, Yi Cai

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Based on the various structural textures, sizes, and shapes of the gray image, the SuperPixel (SP)-based methods convert the rigid pixels into adaptable image patches that share common features. As a result, the SP-based colorization technique not only enhances the color appearance but also preserves the topological integrity of the images. Despite the effectiveness of neural network-based colorization methods, their integration with SP segmentation has traditionally been complex and cumbersome. To address this issue, we propose the Colorization SP Downsampler Denoising AutoEncoder (CSPDAE), where the SP downsampler integrates SP segmentation directly into the network, eliminating the need for any prior input. The SP downsampler addresses the challenges of computational complexity and small region clustering, which are the main obstacles preventing Transformer architectures from being applied to pixel-level segmentation, through the use of SP cross-attention and aggregated positional embedding (APE). Furthermore, we have incorporated a Color Weight (CW) loss, based on the CIEDE2000 color difference formula, to ensure balanced pixel sampling and to improve the precision of detailed color representation. The experimental results confirm the effectiveness of our method, demonstrating its capacity to produce colors with greater structural accuracy and visually appealing details.

源语言英语
文章编号129028
期刊Neurocomputing
618
DOI
出版状态已出版 - 14 2月 2025

指纹

探究 'CSPDAE: Colorization via SuperPixel Downsampler Denoising AutoEncoder towards enhanced color structural clarity' 的科研主题。它们共同构成独一无二的指纹。

引用此