TY - GEN
T1 - MCRLe
T2 - 20th IEEE International Symposium on Biomedical Imaging, ISBI 2023
AU - Liao, Weibin
AU - Jiang, Peirong
AU - Lv, Yi
AU - Xue, Yunjing
AU - Chen, Zhensen
AU - Li, Xuesong
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Multi-modal medical image analysis task with deep neural network (DNN) models has become an area of growing interest. While some works proposed to utilize significant "mismatch"between multi-modal medical images for stroke onset time diagnosis within 4.5 hours, few devoted to diagnosis on dataset with insignificant "mismatch". We tried to promote the development of this problem and overcome some challenges in it. Specifically, we proposed Multi-modal Contrastive Representation Learning, namely MCRLe, which leverages momentum contrastive representation learning to learn "mismatch"between different modalities from the same subject. To achieve the best performance, it eliminates the bias generated during imaging process between modalities using a cross-modal registration technology, and enriches image data using a well-designed data augmentation procedure. We carried out extensive experiments to evaluate MCRLe using a dataset of stroke patients with 136 subjects, and made a validation on three backbone networks including 3D CNN, 3D ResNet-18 and 3D ResNet-50. Experimental results shows that MCRLe could improves the performance of DNN on stroke onset time diagnosis task, and it assists DNN in focusing more on stroke regions with "mismatch"even without using segmentation results of lesion as an auxiliary. Results of cross validation and various backbone network settings further confirm the superiority of MCRLe.
AB - Multi-modal medical image analysis task with deep neural network (DNN) models has become an area of growing interest. While some works proposed to utilize significant "mismatch"between multi-modal medical images for stroke onset time diagnosis within 4.5 hours, few devoted to diagnosis on dataset with insignificant "mismatch". We tried to promote the development of this problem and overcome some challenges in it. Specifically, we proposed Multi-modal Contrastive Representation Learning, namely MCRLe, which leverages momentum contrastive representation learning to learn "mismatch"between different modalities from the same subject. To achieve the best performance, it eliminates the bias generated during imaging process between modalities using a cross-modal registration technology, and enriches image data using a well-designed data augmentation procedure. We carried out extensive experiments to evaluate MCRLe using a dataset of stroke patients with 136 subjects, and made a validation on three backbone networks including 3D CNN, 3D ResNet-18 and 3D ResNet-50. Experimental results shows that MCRLe could improves the performance of DNN on stroke onset time diagnosis task, and it assists DNN in focusing more on stroke regions with "mismatch"even without using segmentation results of lesion as an auxiliary. Results of cross validation and various backbone network settings further confirm the superiority of MCRLe.
KW - Multi-modal analysis
KW - contrastive representation learning
KW - stroke onset time diagnosis
UR - http://www.scopus.com/inward/record.url?scp=85172122722&partnerID=8YFLogxK
U2 - 10.1109/ISBI53787.2023.10230570
DO - 10.1109/ISBI53787.2023.10230570
M3 - Conference contribution
AN - SCOPUS:85172122722
T3 - Proceedings - International Symposium on Biomedical Imaging
BT - 2023 IEEE International Symposium on Biomedical Imaging, ISBI 2023
PB - IEEE Computer Society
Y2 - 18 April 2023 through 21 April 2023
ER -