Joint estimation of depth and motion from a monocular endoscopy image sequence using a multi-loss rebalancing network

Shiyuan Liu; Jingfan Fan; Dengpan Song; Tianyu Fu; Yucong Lin; Deqiang Xiao; Hong Song; Yongtian Wang; Jian Yang

doi:10.1364/BOE.457475

Joint estimation of depth and motion from a monocular endoscopy image sequence using a multi-loss rebalancing network

Shiyuan Liu, Jingfan Fan, Dengpan Song, Tianyu Fu, Yucong Lin, Deqiang Xiao, Hong Song, Yongtian Wang, Jian Yang

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

6 引用（Scopus）

摘要

Building an in vivo three-dimensional (3D) surface model from a monocular endoscopy is an effective technology to improve the intuitiveness and precision of clinical laparoscopic surgery. This paper proposes a multi-loss rebalancing-based method for joint estimation of depth and motion from a monocular endoscopy image sequence. The feature descriptors are used to provide monitoring signals for the depth estimation network and motion estimation network. The epipolar constraints of the sequence frame is considered in the neighborhood spatial information by depth estimation network to enhance the accuracy of depth estimation. The reprojection information of depth estimation is used to reconstruct the camera motion by motion estimation network with a multi-view relative pose fusion mechanism. The relative response loss, feature consistency loss, and epipolar consistency loss function are defined to improve the robustness and accuracy of the proposed unsupervised learning-based method. Evaluations are implemented on public datasets. The error of motion estimation in three scenes decreased by 42.1%,53.6%, and 50.2%, respectively. And the average error of 3D reconstruction is 6.456 ± 1.798mm. This demonstrates its capability to generate reliable depth estimation and trajectory reconstruction results for endoscopy images and meaningful applications in clinical.

源语言	英语
页（从-至）	2707-2727
页数	21
期刊	Biomedical Optics Express
卷	13
期	5
DOI	https://doi.org/10.1364/BOE.457475
出版状态	已出版 - 1 5月 2022

访问文件

10.1364/BOE.457475

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{d26ed95c05d84413ac8ddc47815599b7,

title = "Joint estimation of depth and motion from a monocular endoscopy image sequence using a multi-loss rebalancing network",

abstract = "Building an in vivo three-dimensional (3D) surface model from a monocular endoscopy is an effective technology to improve the intuitiveness and precision of clinical laparoscopic surgery. This paper proposes a multi-loss rebalancing-based method for joint estimation of depth and motion from a monocular endoscopy image sequence. The feature descriptors are used to provide monitoring signals for the depth estimation network and motion estimation network. The epipolar constraints of the sequence frame is considered in the neighborhood spatial information by depth estimation network to enhance the accuracy of depth estimation. The reprojection information of depth estimation is used to reconstruct the camera motion by motion estimation network with a multi-view relative pose fusion mechanism. The relative response loss, feature consistency loss, and epipolar consistency loss function are defined to improve the robustness and accuracy of the proposed unsupervised learning-based method. Evaluations are implemented on public datasets. The error of motion estimation in three scenes decreased by 42.1%,53.6%, and 50.2%, respectively. And the average error of 3D reconstruction is 6.456 ± 1.798mm. This demonstrates its capability to generate reliable depth estimation and trajectory reconstruction results for endoscopy images and meaningful applications in clinical.",

author = "Shiyuan Liu and Jingfan Fan and Dengpan Song and Tianyu Fu and Yucong Lin and Deqiang Xiao and Hong Song and Yongtian Wang and Jian Yang",

note = "Publisher Copyright: {\textcopyright} 2022 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement",

year = "2022",

month = may,

day = "1",

doi = "10.1364/BOE.457475",

language = "English",

volume = "13",

pages = "2707--2727",

journal = "Biomedical Optics Express",

issn = "2156-7085",

publisher = "Optica Publishing Group (formerly OSA)",

number = "5",

}

TY - JOUR

T1 - Joint estimation of depth and motion from a monocular endoscopy image sequence using a multi-loss rebalancing network

AU - Liu, Shiyuan

AU - Fan, Jingfan

AU - Song, Dengpan

AU - Fu, Tianyu

AU - Lin, Yucong

AU - Xiao, Deqiang

AU - Song, Hong

AU - Wang, Yongtian

AU - Yang, Jian

PY - 2022/5/1

Y1 - 2022/5/1

N2 - Building an in vivo three-dimensional (3D) surface model from a monocular endoscopy is an effective technology to improve the intuitiveness and precision of clinical laparoscopic surgery. This paper proposes a multi-loss rebalancing-based method for joint estimation of depth and motion from a monocular endoscopy image sequence. The feature descriptors are used to provide monitoring signals for the depth estimation network and motion estimation network. The epipolar constraints of the sequence frame is considered in the neighborhood spatial information by depth estimation network to enhance the accuracy of depth estimation. The reprojection information of depth estimation is used to reconstruct the camera motion by motion estimation network with a multi-view relative pose fusion mechanism. The relative response loss, feature consistency loss, and epipolar consistency loss function are defined to improve the robustness and accuracy of the proposed unsupervised learning-based method. Evaluations are implemented on public datasets. The error of motion estimation in three scenes decreased by 42.1%,53.6%, and 50.2%, respectively. And the average error of 3D reconstruction is 6.456 ± 1.798mm. This demonstrates its capability to generate reliable depth estimation and trajectory reconstruction results for endoscopy images and meaningful applications in clinical.

AB - Building an in vivo three-dimensional (3D) surface model from a monocular endoscopy is an effective technology to improve the intuitiveness and precision of clinical laparoscopic surgery. This paper proposes a multi-loss rebalancing-based method for joint estimation of depth and motion from a monocular endoscopy image sequence. The feature descriptors are used to provide monitoring signals for the depth estimation network and motion estimation network. The epipolar constraints of the sequence frame is considered in the neighborhood spatial information by depth estimation network to enhance the accuracy of depth estimation. The reprojection information of depth estimation is used to reconstruct the camera motion by motion estimation network with a multi-view relative pose fusion mechanism. The relative response loss, feature consistency loss, and epipolar consistency loss function are defined to improve the robustness and accuracy of the proposed unsupervised learning-based method. Evaluations are implemented on public datasets. The error of motion estimation in three scenes decreased by 42.1%,53.6%, and 50.2%, respectively. And the average error of 3D reconstruction is 6.456 ± 1.798mm. This demonstrates its capability to generate reliable depth estimation and trajectory reconstruction results for endoscopy images and meaningful applications in clinical.

UR - http://www.scopus.com/inward/record.url?scp=85128621199&partnerID=8YFLogxK

U2 - 10.1364/BOE.457475

DO - 10.1364/BOE.457475

M3 - Article

AN - SCOPUS:85128621199

SN - 2156-7085

VL - 13

SP - 2707

EP - 2727

JO - Biomedical Optics Express

JF - Biomedical Optics Express

IS - 5

ER -

Joint estimation of depth and motion from a monocular endoscopy image sequence using a multi-loss rebalancing network

摘要

访问文件

其它文件与链接

指纹

引用此