Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network

Shengyu Hou, Mengyin Fu, Wenjie Song*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

Self-supervised monocular depth estimation methods have achieved remarkable results on natural clear images. However, it is still a serious challenge to directly recover depth information from blurred images caused by long-time exposure while camera fast moving. To address this issue, we propose a unified framework for simultaneous deblurring and depth estimation (SDDE), which has higher coupling performance and flexibility compared with the simple concatenation strategy of deblurring model and depth estimation model. This framework mainly benefits from three features: 1) a novel Task-aware Fusion Module (TFM) to adaptively select the most relevant intermediate shared features for the dual decoder network by aggregating multi-scale features, 2) a unique Spatial Interaction Module (SIM) to learn higher-order representation in the encoder stage to better describe complex boundaries of different classes in high-dimensional space, and focuses on the task-related region by modeling the pairwise spatial correlation of the holistic tensor, 3) a Priors-Based Composite Regularization term to jointly optimize the shared encoder-dual decoder network. This work was evaluated on multiple datasets, including: Stereo blur, KITTI,NYUv2, REDS and our own large-scale stereo blur dataset, resulting in state-of-the-art results for depth estimation and image deblurring, respectively.

Original languageEnglish
Article number3279981
Pages (from-to)7327-7341
Number of pages15
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume33
Issue number12
DOIs
Publication statusPublished - 1 Dec 2023

Keywords

  • Monocular depth estimation
  • generative adversarial network
  • image deblurring
  • multi-task learning

Fingerprint

Dive into the research topics of 'Joint Learning of Image Deblurring and Depth Estimation Through Adversarial Multi-Task Network'. Together they form a unique fingerprint.

Cite this