Human-aware motion deblurring

Ziyi Shen; Wenguan Wang; Xiankai Lu; Jianbing Shen; Haibin Ling; Tingfa Xu; Ling Shao

doi:10.1109/ICCV.2019.00567

Human-aware motion deblurring

Ziyi Shen, Wenguan Wang, Xiankai Lu, Jianbing Shen^*, Haibin Ling, Tingfa Xu, Ling Shao

^*Corresponding author for this work

School of Optics and Photonics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

254 Citations (Scopus)

Abstract

This paper proposes a human-aware deblurring model that disentangles the motion blur between foreground (FG) humans and background (BG). The proposed model is based on a triple-branch encoder-decoder architecture. The first two branches are learned for sharpening FG humans and BG details, respectively; while the third one produces global, harmonious results by comprehensively fusing multi-scale deblurring information from the two domains. The proposed model is further endowed with a supervised, human-aware attention mechanism in an end-to-end fashion. It learns a soft mask that encodes FG human information and explicitly drives the FG/BG decoder-branches to focus on their specific domains. Above designs lead to a fully differentiable motion deblurring network, which can be trained end-to-end. To further benefit the research towards Human-aware Image Deblurring, we introduce a large-scale dataset, named HIDE, which consists of 8,422 blurry and sharp image pairs with 65,784 densely annotated FG human bounding boxes. HIDE is specifically built to span a broad range of scenes, human object sizes, motion patterns, and background complexities. Extensive experiments on public benchmarks and our dataset demonstrate that our model performs favorably against the state-of-the-art motion deblurring methods, especially in capturing semantic details.

Original language	English
Title of host publication	Proceedings - 2019 International Conference on Computer Vision, ICCV 2019
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	5571-5580
Number of pages	10
ISBN (Electronic)	9781728148038
DOIs	https://doi.org/10.1109/ICCV.2019.00567
Publication status	Published - Oct 2019
Event	17th IEEE/CVF International Conference on Computer Vision, ICCV 2019 - Seoul, Korea, Republic of Duration: 27 Oct 2019 → 2 Nov 2019

Publication series

Name	Proceedings of the IEEE International Conference on Computer Vision
Volume	2019-October
ISSN (Print)	1550-5499

Conference

Conference	17th IEEE/CVF International Conference on Computer Vision, ICCV 2019
Country/Territory	Korea, Republic of
City	Seoul
Period	27/10/19 → 2/11/19

Access to Document

10.1109/ICCV.2019.00567

Cite this

Shen, Z., Wang, W., Lu, X., Shen, J., Ling, H., Xu, T., & Shao, L. (2019). Human-aware motion deblurring. In Proceedings - 2019 International Conference on Computer Vision, ICCV 2019 (pp. 5571-5580). Article 9010839 (Proceedings of the IEEE International Conference on Computer Vision; Vol. 2019-October). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCV.2019.00567

@inproceedings{cd9d4fd8ff78456cbef3bdde3189999d,

title = "Human-aware motion deblurring",

abstract = "This paper proposes a human-aware deblurring model that disentangles the motion blur between foreground (FG) humans and background (BG). The proposed model is based on a triple-branch encoder-decoder architecture. The first two branches are learned for sharpening FG humans and BG details, respectively; while the third one produces global, harmonious results by comprehensively fusing multi-scale deblurring information from the two domains. The proposed model is further endowed with a supervised, human-aware attention mechanism in an end-to-end fashion. It learns a soft mask that encodes FG human information and explicitly drives the FG/BG decoder-branches to focus on their specific domains. Above designs lead to a fully differentiable motion deblurring network, which can be trained end-to-end. To further benefit the research towards Human-aware Image Deblurring, we introduce a large-scale dataset, named HIDE, which consists of 8,422 blurry and sharp image pairs with 65,784 densely annotated FG human bounding boxes. HIDE is specifically built to span a broad range of scenes, human object sizes, motion patterns, and background complexities. Extensive experiments on public benchmarks and our dataset demonstrate that our model performs favorably against the state-of-the-art motion deblurring methods, especially in capturing semantic details.",

author = "Ziyi Shen and Wenguan Wang and Xiankai Lu and Jianbing Shen and Haibin Ling and Tingfa Xu and Ling Shao",

note = "Publisher Copyright: {\textcopyright} 2019 IEEE.; 17th IEEE/CVF International Conference on Computer Vision, ICCV 2019 ; Conference date: 27-10-2019 Through 02-11-2019",

year = "2019",

month = oct,

doi = "10.1109/ICCV.2019.00567",

language = "English",

series = "Proceedings of the IEEE International Conference on Computer Vision",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "5571--5580",

booktitle = "Proceedings - 2019 International Conference on Computer Vision, ICCV 2019",

address = "United States",

}

Shen, Z, Wang, W, Lu, X, Shen, J, Ling, H, Xu, T & Shao, L 2019, Human-aware motion deblurring. in Proceedings - 2019 International Conference on Computer Vision, ICCV 2019., 9010839, Proceedings of the IEEE International Conference on Computer Vision, vol. 2019-October, Institute of Electrical and Electronics Engineers Inc., pp. 5571-5580, 17th IEEE/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea, Republic of, 27/10/19. https://doi.org/10.1109/ICCV.2019.00567

Human-aware motion deblurring. / Shen, Ziyi; Wang, Wenguan; Lu, Xiankai et al.
Proceedings - 2019 International Conference on Computer Vision, ICCV 2019. Institute of Electrical and Electronics Engineers Inc., 2019. p. 5571-5580 9010839 (Proceedings of the IEEE International Conference on Computer Vision; Vol. 2019-October).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Human-aware motion deblurring

AU - Shen, Ziyi

AU - Wang, Wenguan

AU - Lu, Xiankai

AU - Shen, Jianbing

AU - Ling, Haibin

AU - Xu, Tingfa

AU - Shao, Ling

PY - 2019/10

Y1 - 2019/10

N2 - This paper proposes a human-aware deblurring model that disentangles the motion blur between foreground (FG) humans and background (BG). The proposed model is based on a triple-branch encoder-decoder architecture. The first two branches are learned for sharpening FG humans and BG details, respectively; while the third one produces global, harmonious results by comprehensively fusing multi-scale deblurring information from the two domains. The proposed model is further endowed with a supervised, human-aware attention mechanism in an end-to-end fashion. It learns a soft mask that encodes FG human information and explicitly drives the FG/BG decoder-branches to focus on their specific domains. Above designs lead to a fully differentiable motion deblurring network, which can be trained end-to-end. To further benefit the research towards Human-aware Image Deblurring, we introduce a large-scale dataset, named HIDE, which consists of 8,422 blurry and sharp image pairs with 65,784 densely annotated FG human bounding boxes. HIDE is specifically built to span a broad range of scenes, human object sizes, motion patterns, and background complexities. Extensive experiments on public benchmarks and our dataset demonstrate that our model performs favorably against the state-of-the-art motion deblurring methods, especially in capturing semantic details.

AB - This paper proposes a human-aware deblurring model that disentangles the motion blur between foreground (FG) humans and background (BG). The proposed model is based on a triple-branch encoder-decoder architecture. The first two branches are learned for sharpening FG humans and BG details, respectively; while the third one produces global, harmonious results by comprehensively fusing multi-scale deblurring information from the two domains. The proposed model is further endowed with a supervised, human-aware attention mechanism in an end-to-end fashion. It learns a soft mask that encodes FG human information and explicitly drives the FG/BG decoder-branches to focus on their specific domains. Above designs lead to a fully differentiable motion deblurring network, which can be trained end-to-end. To further benefit the research towards Human-aware Image Deblurring, we introduce a large-scale dataset, named HIDE, which consists of 8,422 blurry and sharp image pairs with 65,784 densely annotated FG human bounding boxes. HIDE is specifically built to span a broad range of scenes, human object sizes, motion patterns, and background complexities. Extensive experiments on public benchmarks and our dataset demonstrate that our model performs favorably against the state-of-the-art motion deblurring methods, especially in capturing semantic details.

UR - http://www.scopus.com/inward/record.url?scp=85081895122&partnerID=8YFLogxK

U2 - 10.1109/ICCV.2019.00567

DO - 10.1109/ICCV.2019.00567

M3 - Conference contribution

AN - SCOPUS:85081895122

T3 - Proceedings of the IEEE International Conference on Computer Vision

SP - 5571

EP - 5580

BT - Proceedings - 2019 International Conference on Computer Vision, ICCV 2019

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 17th IEEE/CVF International Conference on Computer Vision, ICCV 2019

Y2 - 27 October 2019 through 2 November 2019

ER -

Human-aware motion deblurring

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this