Multi-view Test-time Adaptation for Semantic Segmentation in Clinical Cataract Surgery

Heng Li; Mingyang Ou; Haojin Li; Zhongxi Qiu; Ke Niu; Huazhu Fu; Jiang Liu

doi:10.1109/TMI.2025.3529875

Multi-view Test-time Adaptation for Semantic Segmentation in Clinical Cataract Surgery

Heng Li^*, Mingyang Ou, Haojin Li, Zhongxi Qiu, Ke Niu, Huazhu Fu, Jiang Liu^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

Abstract

Cataract surgery, a widely performed operation worldwide, is incorporating semantic segmentation to advance computer-assisted intervention. However, the tissue appearance and illumination in cataract surgery often differ among clinical centers, intensifying the issue of domain shifts. While domain adaptation offers remedies to the shifts, the necessity for data centralization raises additional privacy concerns. To overcome these challenges, we propose a Multi-view Test-time Adaptation algorithm (MUTA) to segment cataract surgical scenes, which leverages multi-view learning to enhance model training within the source domain and model adaptation within the target domain. In the training phase, the segmentation model is equipped with multi-view decoders to boost its robustness against variations in cataract surgery. During the inference phase, test-time adaptation is implemented using multi-view knowledge distillation, enabling model updates in clinics without data centralization or privacy concerns. We conducted experiments in a simulated cross-center scenario using several cataract surgery datasets to evaluate the effectiveness of MUTA. Through comparisons and investigations, we have validated that MUTA effectively learns a robust source model and adapts the model to target data during the practical inference phase. Code and datasets are available at https://github.com/liamheng/CAI-algorithms.

Original language	English
Journal	IEEE Transactions on Medical Imaging
DOIs	https://doi.org/10.1109/TMI.2025.3529875
Publication status	Accepted/In press - 2025
Externally published	Yes

Keywords

Cataract Surgery
multi-view learning
semantic segmentation
test-time adaptation

Access to Document

10.1109/TMI.2025.3529875

Cite this

Li, H., Ou, M., Li, H., Qiu, Z., Niu, K., Fu, H., & Liu, J. (Accepted/In press). Multi-view Test-time Adaptation for Semantic Segmentation in Clinical Cataract Surgery. IEEE Transactions on Medical Imaging. https://doi.org/10.1109/TMI.2025.3529875

@article{8dbd614823e1488885621a5e8d31d3c9,

title = "Multi-view Test-time Adaptation for Semantic Segmentation in Clinical Cataract Surgery",

abstract = "Cataract surgery, a widely performed operation worldwide, is incorporating semantic segmentation to advance computer-assisted intervention. However, the tissue appearance and illumination in cataract surgery often differ among clinical centers, intensifying the issue of domain shifts. While domain adaptation offers remedies to the shifts, the necessity for data centralization raises additional privacy concerns. To overcome these challenges, we propose a Multi-view Test-time Adaptation algorithm (MUTA) to segment cataract surgical scenes, which leverages multi-view learning to enhance model training within the source domain and model adaptation within the target domain. In the training phase, the segmentation model is equipped with multi-view decoders to boost its robustness against variations in cataract surgery. During the inference phase, test-time adaptation is implemented using multi-view knowledge distillation, enabling model updates in clinics without data centralization or privacy concerns. We conducted experiments in a simulated cross-center scenario using several cataract surgery datasets to evaluate the effectiveness of MUTA. Through comparisons and investigations, we have validated that MUTA effectively learns a robust source model and adapts the model to target data during the practical inference phase. Code and datasets are available at https://github.com/liamheng/CAI-algorithms.",

keywords = "Cataract Surgery, multi-view learning, semantic segmentation, test-time adaptation",

author = "Heng Li and Mingyang Ou and Haojin Li and Zhongxi Qiu and Ke Niu and Huazhu Fu and Jiang Liu",

note = "Publisher Copyright: {\textcopyright} 1982-2012 IEEE.",

year = "2025",

doi = "10.1109/TMI.2025.3529875",

language = "English",

journal = "IEEE Transactions on Medical Imaging",

issn = "0278-0062",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Multi-view Test-time Adaptation for Semantic Segmentation in Clinical Cataract Surgery

AU - Li, Heng

AU - Ou, Mingyang

AU - Li, Haojin

AU - Qiu, Zhongxi

AU - Niu, Ke

AU - Fu, Huazhu

AU - Liu, Jiang

PY - 2025

Y1 - 2025

N2 - Cataract surgery, a widely performed operation worldwide, is incorporating semantic segmentation to advance computer-assisted intervention. However, the tissue appearance and illumination in cataract surgery often differ among clinical centers, intensifying the issue of domain shifts. While domain adaptation offers remedies to the shifts, the necessity for data centralization raises additional privacy concerns. To overcome these challenges, we propose a Multi-view Test-time Adaptation algorithm (MUTA) to segment cataract surgical scenes, which leverages multi-view learning to enhance model training within the source domain and model adaptation within the target domain. In the training phase, the segmentation model is equipped with multi-view decoders to boost its robustness against variations in cataract surgery. During the inference phase, test-time adaptation is implemented using multi-view knowledge distillation, enabling model updates in clinics without data centralization or privacy concerns. We conducted experiments in a simulated cross-center scenario using several cataract surgery datasets to evaluate the effectiveness of MUTA. Through comparisons and investigations, we have validated that MUTA effectively learns a robust source model and adapts the model to target data during the practical inference phase. Code and datasets are available at https://github.com/liamheng/CAI-algorithms.

AB - Cataract surgery, a widely performed operation worldwide, is incorporating semantic segmentation to advance computer-assisted intervention. However, the tissue appearance and illumination in cataract surgery often differ among clinical centers, intensifying the issue of domain shifts. While domain adaptation offers remedies to the shifts, the necessity for data centralization raises additional privacy concerns. To overcome these challenges, we propose a Multi-view Test-time Adaptation algorithm (MUTA) to segment cataract surgical scenes, which leverages multi-view learning to enhance model training within the source domain and model adaptation within the target domain. In the training phase, the segmentation model is equipped with multi-view decoders to boost its robustness against variations in cataract surgery. During the inference phase, test-time adaptation is implemented using multi-view knowledge distillation, enabling model updates in clinics without data centralization or privacy concerns. We conducted experiments in a simulated cross-center scenario using several cataract surgery datasets to evaluate the effectiveness of MUTA. Through comparisons and investigations, we have validated that MUTA effectively learns a robust source model and adapts the model to target data during the practical inference phase. Code and datasets are available at https://github.com/liamheng/CAI-algorithms.

KW - Cataract Surgery

KW - multi-view learning

KW - semantic segmentation

KW - test-time adaptation

UR - http://www.scopus.com/inward/record.url?scp=85215692916&partnerID=8YFLogxK

U2 - 10.1109/TMI.2025.3529875

DO - 10.1109/TMI.2025.3529875

M3 - Article

AN - SCOPUS:85215692916

SN - 0278-0062

JO - IEEE Transactions on Medical Imaging

JF - IEEE Transactions on Medical Imaging

ER -

Multi-view Test-time Adaptation for Semantic Segmentation in Clinical Cataract Surgery

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this