CSPDAE: Colorization via SuperPixel Downsampler Denoising AutoEncoder towards enhanced color structural clarity

Wenqu Zhao; Lingxue Wang; Miao Dong; Yi Cai

doi:10.1016/j.neucom.2024.129028

CSPDAE: Colorization via SuperPixel Downsampler Denoising AutoEncoder towards enhanced color structural clarity

Wenqu Zhao, Lingxue Wang^*, Miao Dong, Yi Cai

^*此作品的通讯作者

光电学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Based on the various structural textures, sizes, and shapes of the gray image, the SuperPixel (SP)-based methods convert the rigid pixels into adaptable image patches that share common features. As a result, the SP-based colorization technique not only enhances the color appearance but also preserves the topological integrity of the images. Despite the effectiveness of neural network-based colorization methods, their integration with SP segmentation has traditionally been complex and cumbersome. To address this issue, we propose the Colorization SP Downsampler Denoising AutoEncoder (CSPDAE), where the SP downsampler integrates SP segmentation directly into the network, eliminating the need for any prior input. The SP downsampler addresses the challenges of computational complexity and small region clustering, which are the main obstacles preventing Transformer architectures from being applied to pixel-level segmentation, through the use of SP cross-attention and aggregated positional embedding (APE). Furthermore, we have incorporated a Color Weight (CW) loss, based on the CIEDE2000 color difference formula, to ensure balanced pixel sampling and to improve the precision of detailed color representation. The experimental results confirm the effectiveness of our method, demonstrating its capacity to produce colors with greater structural accuracy and visually appealing details.

源语言	英语
文章编号	129028
期刊	Neurocomputing
卷	618
DOI	https://doi.org/10.1016/j.neucom.2024.129028
出版状态	已出版 - 14 2月 2025

访问文件

10.1016/j.neucom.2024.129028

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{485832f7282b423899d1407b694b5bbe,

title = "CSPDAE: Colorization via SuperPixel Downsampler Denoising AutoEncoder towards enhanced color structural clarity",

abstract = "Based on the various structural textures, sizes, and shapes of the gray image, the SuperPixel (SP)-based methods convert the rigid pixels into adaptable image patches that share common features. As a result, the SP-based colorization technique not only enhances the color appearance but also preserves the topological integrity of the images. Despite the effectiveness of neural network-based colorization methods, their integration with SP segmentation has traditionally been complex and cumbersome. To address this issue, we propose the Colorization SP Downsampler Denoising AutoEncoder (CSPDAE), where the SP downsampler integrates SP segmentation directly into the network, eliminating the need for any prior input. The SP downsampler addresses the challenges of computational complexity and small region clustering, which are the main obstacles preventing Transformer architectures from being applied to pixel-level segmentation, through the use of SP cross-attention and aggregated positional embedding (APE). Furthermore, we have incorporated a Color Weight (CW) loss, based on the CIEDE2000 color difference formula, to ensure balanced pixel sampling and to improve the precision of detailed color representation. The experimental results confirm the effectiveness of our method, demonstrating its capacity to produce colors with greater structural accuracy and visually appealing details.",

keywords = "CIEDE2000, Colorization, Deeplearning, Superpixel",

author = "Wenqu Zhao and Lingxue Wang and Miao Dong and Yi Cai",

note = "Publisher Copyright: {\textcopyright} 2024",

year = "2025",

month = feb,

day = "14",

doi = "10.1016/j.neucom.2024.129028",

language = "English",

volume = "618",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - CSPDAE

T2 - Colorization via SuperPixel Downsampler Denoising AutoEncoder towards enhanced color structural clarity

AU - Zhao, Wenqu

AU - Wang, Lingxue

AU - Dong, Miao

AU - Cai, Yi

PY - 2025/2/14

Y1 - 2025/2/14

N2 - Based on the various structural textures, sizes, and shapes of the gray image, the SuperPixel (SP)-based methods convert the rigid pixels into adaptable image patches that share common features. As a result, the SP-based colorization technique not only enhances the color appearance but also preserves the topological integrity of the images. Despite the effectiveness of neural network-based colorization methods, their integration with SP segmentation has traditionally been complex and cumbersome. To address this issue, we propose the Colorization SP Downsampler Denoising AutoEncoder (CSPDAE), where the SP downsampler integrates SP segmentation directly into the network, eliminating the need for any prior input. The SP downsampler addresses the challenges of computational complexity and small region clustering, which are the main obstacles preventing Transformer architectures from being applied to pixel-level segmentation, through the use of SP cross-attention and aggregated positional embedding (APE). Furthermore, we have incorporated a Color Weight (CW) loss, based on the CIEDE2000 color difference formula, to ensure balanced pixel sampling and to improve the precision of detailed color representation. The experimental results confirm the effectiveness of our method, demonstrating its capacity to produce colors with greater structural accuracy and visually appealing details.

AB - Based on the various structural textures, sizes, and shapes of the gray image, the SuperPixel (SP)-based methods convert the rigid pixels into adaptable image patches that share common features. As a result, the SP-based colorization technique not only enhances the color appearance but also preserves the topological integrity of the images. Despite the effectiveness of neural network-based colorization methods, their integration with SP segmentation has traditionally been complex and cumbersome. To address this issue, we propose the Colorization SP Downsampler Denoising AutoEncoder (CSPDAE), where the SP downsampler integrates SP segmentation directly into the network, eliminating the need for any prior input. The SP downsampler addresses the challenges of computational complexity and small region clustering, which are the main obstacles preventing Transformer architectures from being applied to pixel-level segmentation, through the use of SP cross-attention and aggregated positional embedding (APE). Furthermore, we have incorporated a Color Weight (CW) loss, based on the CIEDE2000 color difference formula, to ensure balanced pixel sampling and to improve the precision of detailed color representation. The experimental results confirm the effectiveness of our method, demonstrating its capacity to produce colors with greater structural accuracy and visually appealing details.

KW - CIEDE2000

KW - Colorization

KW - Deeplearning

KW - Superpixel

UR - http://www.scopus.com/inward/record.url?scp=85211172656&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2024.129028

DO - 10.1016/j.neucom.2024.129028

M3 - Article

AN - SCOPUS:85211172656

SN - 0925-2312

VL - 618

JO - Neurocomputing

JF - Neurocomputing

M1 - 129028

ER -

CSPDAE: Colorization via SuperPixel Downsampler Denoising AutoEncoder towards enhanced color structural clarity

摘要

访问文件

其它文件与链接

指纹

引用此