Channel and spatial attention-guided network for deep high dynamic range imaging with large motions

Pingwei Zhang; Wenbiao Zhou; Luyao Fan

doi:10.1007/s00371-023-02871-5

Channel and spatial attention-guided network for deep high dynamic range imaging with large motions

Pingwei Zhang, Wenbiao Zhou^*, Luyao Fan

^*此作品的通讯作者

集成电路与电子学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Multi-exposure fusion (MEF) is widely researched and applied to high dynamic range (HDR) imaging, where one of the most challenging problems is the artifacts caused by the motion of objects between input images. Recently, deep learning methods have been widely applied for HDR imaging with excellent results, showing significant advantages. However, many methods cannot avoid artifacts due to inaccurate alignment before merging HDR images. In this paper, we propose an end-to-end network (C-ED-GMNET) with a channel and spatial attention network, an encoder–decoder network and a gradual merging network for generating artifact-free HDR images in dynamic scenes. The attention module consists of two submodules to identify useful features and exclude harmful components in inputs from both channel and spatial dimensions, respectively. The attention-guided feature maps are sent to the encoders for further feature extraction, and then, the outputs are sent to a gradual merging module consisting of two steps to generate deep features progressively. Besides, the differences between the merged image and the original images are identified by applying global residual learning with all the inputs, and the merged image feature is recovered by the decoder to obtain the final HDR image. Quantitative and qualitative experiments on two public datasets show that the proposed C-ED-GMNET produces better results than existing state-of-the-art methods and significantly reduces the runtime due to the encoders which reduce the amount of computation.

源语言	英语
页（从-至）	1583-1599
页数	17
期刊	Visual Computer
卷	40
期	3
DOI	https://doi.org/10.1007/s00371-023-02871-5
出版状态	已出版 - 3月 2024

访问文件

10.1007/s00371-023-02871-5

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhang, P., Zhou, W., & Fan, L. (2024). Channel and spatial attention-guided network for deep high dynamic range imaging with large motions. Visual Computer, 40(3), 1583-1599. https://doi.org/10.1007/s00371-023-02871-5

@article{03637c8018704b8484a775c6080e6e4c,

title = "Channel and spatial attention-guided network for deep high dynamic range imaging with large motions",

abstract = "Multi-exposure fusion (MEF) is widely researched and applied to high dynamic range (HDR) imaging, where one of the most challenging problems is the artifacts caused by the motion of objects between input images. Recently, deep learning methods have been widely applied for HDR imaging with excellent results, showing significant advantages. However, many methods cannot avoid artifacts due to inaccurate alignment before merging HDR images. In this paper, we propose an end-to-end network (C-ED-GMNET) with a channel and spatial attention network, an encoder–decoder network and a gradual merging network for generating artifact-free HDR images in dynamic scenes. The attention module consists of two submodules to identify useful features and exclude harmful components in inputs from both channel and spatial dimensions, respectively. The attention-guided feature maps are sent to the encoders for further feature extraction, and then, the outputs are sent to a gradual merging module consisting of two steps to generate deep features progressively. Besides, the differences between the merged image and the original images are identified by applying global residual learning with all the inputs, and the merged image feature is recovered by the decoder to obtain the final HDR image. Quantitative and qualitative experiments on two public datasets show that the proposed C-ED-GMNET produces better results than existing state-of-the-art methods and significantly reduces the runtime due to the encoders which reduce the amount of computation.",

keywords = "Attention mechanism, Convolutional neural network, Ghosting artifacts, High dynamic range imaging, Multi-exposure fusion",

author = "Pingwei Zhang and Wenbiao Zhou and Luyao Fan",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023.",

year = "2024",

month = mar,

doi = "10.1007/s00371-023-02871-5",

language = "English",

volume = "40",

pages = "1583--1599",

journal = "Visual Computer",

issn = "0178-2789",

publisher = "Springer Verlag",

number = "3",

}

TY - JOUR

T1 - Channel and spatial attention-guided network for deep high dynamic range imaging with large motions

AU - Zhang, Pingwei

AU - Zhou, Wenbiao

AU - Fan, Luyao

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023.

PY - 2024/3

Y1 - 2024/3

N2 - Multi-exposure fusion (MEF) is widely researched and applied to high dynamic range (HDR) imaging, where one of the most challenging problems is the artifacts caused by the motion of objects between input images. Recently, deep learning methods have been widely applied for HDR imaging with excellent results, showing significant advantages. However, many methods cannot avoid artifacts due to inaccurate alignment before merging HDR images. In this paper, we propose an end-to-end network (C-ED-GMNET) with a channel and spatial attention network, an encoder–decoder network and a gradual merging network for generating artifact-free HDR images in dynamic scenes. The attention module consists of two submodules to identify useful features and exclude harmful components in inputs from both channel and spatial dimensions, respectively. The attention-guided feature maps are sent to the encoders for further feature extraction, and then, the outputs are sent to a gradual merging module consisting of two steps to generate deep features progressively. Besides, the differences between the merged image and the original images are identified by applying global residual learning with all the inputs, and the merged image feature is recovered by the decoder to obtain the final HDR image. Quantitative and qualitative experiments on two public datasets show that the proposed C-ED-GMNET produces better results than existing state-of-the-art methods and significantly reduces the runtime due to the encoders which reduce the amount of computation.

AB - Multi-exposure fusion (MEF) is widely researched and applied to high dynamic range (HDR) imaging, where one of the most challenging problems is the artifacts caused by the motion of objects between input images. Recently, deep learning methods have been widely applied for HDR imaging with excellent results, showing significant advantages. However, many methods cannot avoid artifacts due to inaccurate alignment before merging HDR images. In this paper, we propose an end-to-end network (C-ED-GMNET) with a channel and spatial attention network, an encoder–decoder network and a gradual merging network for generating artifact-free HDR images in dynamic scenes. The attention module consists of two submodules to identify useful features and exclude harmful components in inputs from both channel and spatial dimensions, respectively. The attention-guided feature maps are sent to the encoders for further feature extraction, and then, the outputs are sent to a gradual merging module consisting of two steps to generate deep features progressively. Besides, the differences between the merged image and the original images are identified by applying global residual learning with all the inputs, and the merged image feature is recovered by the decoder to obtain the final HDR image. Quantitative and qualitative experiments on two public datasets show that the proposed C-ED-GMNET produces better results than existing state-of-the-art methods and significantly reduces the runtime due to the encoders which reduce the amount of computation.

KW - Attention mechanism

KW - Convolutional neural network

KW - Ghosting artifacts

KW - High dynamic range imaging

KW - Multi-exposure fusion

UR - http://www.scopus.com/inward/record.url?scp=85153590429&partnerID=8YFLogxK

U2 - 10.1007/s00371-023-02871-5

DO - 10.1007/s00371-023-02871-5

M3 - Article

AN - SCOPUS:85153590429

SN - 0178-2789

VL - 40

SP - 1583

EP - 1599

JO - Visual Computer

JF - Visual Computer

IS - 3

ER -

Channel and spatial attention-guided network for deep high dynamic range imaging with large motions

摘要

访问文件

其它文件与链接

指纹

引用此