Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network

Shengyu Hou; Mengyin Fu; Wenjie Song

doi:10.1109/TCSVT.2023.3279981

Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network

Shengyu Hou, Mengyin Fu, Wenjie Song^*

^*此作品的通讯作者

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

4 引用（Scopus）

摘要

Self-supervised monocular depth estimation methods have achieved remarkable results on natural clear images. However, it is still a serious challenge to directly recover depth information from blurred images caused by long-time exposure while camera fast moving. To address this issue, we propose a unified framework for simultaneous deblurring and depth estimation (SDDE), which has higher coupling performance and flexibility compared with the simple concatenation strategy of deblurring model and depth estimation model. This framework mainly benefits from three features: 1) a novel Task-aware Fusion Module (TFM) to adaptively select the most relevant intermediate shared features for the dual decoder network by aggregating multi-scale features, 2) a unique Spatial Interaction Module (SIM) to learn higher-order representation in the encoder stage to better describe complex boundaries of different classes in high-dimensional space, and focuses on the task-related region by modeling the pairwise spatial correlation of the holistic tensor, 3) a Priors-Based Composite Regularization term to jointly optimize the shared encoder-dual decoder network. This work was evaluated on multiple datasets, including: Stereo blur, KITTI,NYUv2, REDS and our own large-scale stereo blur dataset, resulting in state-of-the-art results for depth estimation and image deblurring, respectively.

源语言	英语
文章编号	3279981
页（从-至）	7327-7341
页数	15
期刊	IEEE Transactions on Circuits and Systems for Video Technology
卷	33
期	12
DOI	https://doi.org/10.1109/TCSVT.2023.3279981
出版状态	已出版 - 1 12月 2023

访问文件

10.1109/TCSVT.2023.3279981

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{9a4a57b0929e4d7ab2a3f99cef6a7e5f,

title = "Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network",

abstract = "Self-supervised monocular depth estimation methods have achieved remarkable results on natural clear images. However, it is still a serious challenge to directly recover depth information from blurred images caused by long-time exposure while camera fast moving. To address this issue, we propose a unified framework for simultaneous deblurring and depth estimation (SDDE), which has higher coupling performance and flexibility compared with the simple concatenation strategy of deblurring model and depth estimation model. This framework mainly benefits from three features: 1) a novel Task-aware Fusion Module (TFM) to adaptively select the most relevant intermediate shared features for the dual decoder network by aggregating multi-scale features, 2) a unique Spatial Interaction Module (SIM) to learn higher-order representation in the encoder stage to better describe complex boundaries of different classes in high-dimensional space, and focuses on the task-related region by modeling the pairwise spatial correlation of the holistic tensor, 3) a Priors-Based Composite Regularization term to jointly optimize the shared encoder-dual decoder network. This work was evaluated on multiple datasets, including: Stereo blur, KITTI,NYUv2, REDS and our own large-scale stereo blur dataset, resulting in state-of-the-art results for depth estimation and image deblurring, respectively.",

keywords = "Monocular depth estimation, generative adversarial network, image deblurring, multi-task learning",

author = "Shengyu Hou and Mengyin Fu and Wenjie Song",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2023",

month = dec,

day = "1",

doi = "10.1109/TCSVT.2023.3279981",

language = "English",

volume = "33",

pages = "7327--7341",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "12",

}

TY - JOUR

T1 - Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network

AU - Hou, Shengyu

AU - Fu, Mengyin

AU - Song, Wenjie

PY - 2023/12/1

Y1 - 2023/12/1

N2 - Self-supervised monocular depth estimation methods have achieved remarkable results on natural clear images. However, it is still a serious challenge to directly recover depth information from blurred images caused by long-time exposure while camera fast moving. To address this issue, we propose a unified framework for simultaneous deblurring and depth estimation (SDDE), which has higher coupling performance and flexibility compared with the simple concatenation strategy of deblurring model and depth estimation model. This framework mainly benefits from three features: 1) a novel Task-aware Fusion Module (TFM) to adaptively select the most relevant intermediate shared features for the dual decoder network by aggregating multi-scale features, 2) a unique Spatial Interaction Module (SIM) to learn higher-order representation in the encoder stage to better describe complex boundaries of different classes in high-dimensional space, and focuses on the task-related region by modeling the pairwise spatial correlation of the holistic tensor, 3) a Priors-Based Composite Regularization term to jointly optimize the shared encoder-dual decoder network. This work was evaluated on multiple datasets, including: Stereo blur, KITTI,NYUv2, REDS and our own large-scale stereo blur dataset, resulting in state-of-the-art results for depth estimation and image deblurring, respectively.

AB - Self-supervised monocular depth estimation methods have achieved remarkable results on natural clear images. However, it is still a serious challenge to directly recover depth information from blurred images caused by long-time exposure while camera fast moving. To address this issue, we propose a unified framework for simultaneous deblurring and depth estimation (SDDE), which has higher coupling performance and flexibility compared with the simple concatenation strategy of deblurring model and depth estimation model. This framework mainly benefits from three features: 1) a novel Task-aware Fusion Module (TFM) to adaptively select the most relevant intermediate shared features for the dual decoder network by aggregating multi-scale features, 2) a unique Spatial Interaction Module (SIM) to learn higher-order representation in the encoder stage to better describe complex boundaries of different classes in high-dimensional space, and focuses on the task-related region by modeling the pairwise spatial correlation of the holistic tensor, 3) a Priors-Based Composite Regularization term to jointly optimize the shared encoder-dual decoder network. This work was evaluated on multiple datasets, including: Stereo blur, KITTI,NYUv2, REDS and our own large-scale stereo blur dataset, resulting in state-of-the-art results for depth estimation and image deblurring, respectively.

KW - Monocular depth estimation

KW - generative adversarial network

KW - image deblurring

KW - multi-task learning

UR - http://www.scopus.com/inward/record.url?scp=85161080246&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2023.3279981

DO - 10.1109/TCSVT.2023.3279981

M3 - Article

AN - SCOPUS:85161080246

SN - 1051-8215

VL - 33

SP - 7327

EP - 7341

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 12

M1 - 3279981

ER -

Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network

摘要

访问文件

其它文件与链接

指纹

引用此