Self-Supervised Monocular Depth Estimation for Endoscopic Imaging

Changsheng Li, Xue Li, Kaifeng Wang, Wenxin Chen, Qingyao Liu, Xingguang Duan

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Endoscopy holds a pivotal role in the early detection and treatment of diverse diseases, with artificial intelligence (AI)-assisted methods increasingly gaining prominence in disease screening. Among them, the depth estimation from endoscopic sequences is crucial for a spectrum of AI-assisted surgical techniques. However, the development of endoscopic depth estimation algorithms presents a formidable challenge due to the unique environmental intricacies and constraints within the dataset. This paper proposes a self-supervised depth estimation network to comprehensively explore the brightness changes in endoscopic images, and fuse different features at multiple levels to achieve an accurate prediction of endoscopic depth. First, a FlowNet is designed to evaluate the brightness changes of adjacent frames by calculating the multi-scale structural similarity. Second, a feature fusion module is presented to capture multi-scale contextual information. Experiments show that the average accuracy of the algorithm is 97.03% in the Stereo Correspondence and Reconstruction of Endoscopic Data (SCARED dataset). Based on the training parameters of the SCARED dataset, the algorithm achieves superior performance on the other two datasets (EndoSLAM and KVASIR dataset), indicating that the algorithm has good generalization performance.

Original languageEnglish
Pages (from-to)1-11
Number of pages11
JournalIEEE Journal of Biomedical and Health Informatics
DOIs
Publication statusAccepted/In press - 2024

Keywords

  • Accuracy
  • Brightness
  • brightness inconsistency
  • depth estimation
  • Endoscopes
  • Estimation
  • feature fusion
  • self-supervised learning
  • Surgery
  • Surgical vision
  • Three-dimensional displays
  • Training

Fingerprint

Dive into the research topics of 'Self-Supervised Monocular Depth Estimation for Endoscopic Imaging'. Together they form a unique fingerprint.

Cite this