Domain Adaptation for Semantic Segmentation of Cataract Surgical Images Based on Masked Image Consistency

Yuzhu Zhang, Yijie Pan*, Mingyang Ou, Guanghui Gong, Qinhu Zhang, Haojin Li, Heng Li*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Cataract surgery is a complex procedure requiring precise execution of multiple steps. To improve the accuracy and efficiency of cataract surgeries, we present a semantic segmentation model for cataract surgery scenes. Our model leverages Unsupervised Domain Adaptation (UDA) techniques to enhance segmentation performance in clinical surgical environments, addressing challenges such as domain shift, occlusions between surgical tools and tissues, and long tail problem. The model utilizes a Teacher-Student model, where a student model is trained in the target domain with pseudo-labels generated by an Exponential Moving Average (EMA) teacher model, ensuring robust learning across domains. Additionally, we utilize a Masked Image Consistency (MIC) module to improve the model’s understanding of occluded regions by enforcing consistency between masked and unmasked predictions. To mitigate class imbalance between anatomical structures and surgical tools, we employ a maximum squares loss, enabling the model to achieve balanced learning. Our results demonstrate that the proposed model improves segmentation accuracy and robustness in cataract surgery scenarios.

Original languageEnglish
Title of host publicationApplied Intelligence - 2nd International Conference, ICAI 2024, Proceedings
EditorsDe-Shuang Huang, Wei Chen, Chuanlei Zhang, Yijie Pan, Qinhu Zhang, Xiangzeng Kong
PublisherSpringer Science and Business Media Deutschland GmbH
Pages243-254
Number of pages12
ISBN (Print)9789819619061
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event2nd International Conference on Applied Intelligence, ICAI 2024 - Zhenzhou, China
Duration: 22 Nov 202425 Nov 2024

Publication series

NameCommunications in Computer and Information Science
Volume2387 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference2nd International Conference on Applied Intelligence, ICAI 2024
Country/TerritoryChina
CityZhenzhou
Period22/11/2425/11/24

Keywords

  • Cataract Surgery
  • Semantic Segmentation
  • Unsupervised Domain Adaptation

Fingerprint

Dive into the research topics of 'Domain Adaptation for Semantic Segmentation of Cataract Surgical Images Based on Masked Image Consistency'. Together they form a unique fingerprint.

Cite this