Improving image segmentation with contextual and structural similarity

Xiaoyang Chen; Qin Liu; Hannah H. Deng; Tianshu Kuang; Henry Hung Ying Lin; Deqiang Xiao; Jaime Gateno; James J. Xia; Pew Thian Yap

doi:10.1016/j.patcog.2024.110489

Improving image segmentation with contextual and structural similarity

Xiaoyang Chen, Qin Liu, Hannah H. Deng, Tianshu Kuang, Henry Hung Ying Lin, Deqiang Xiao, Jaime Gateno, James J. Xia, Pew Thian Yap^*

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

1 引用（Scopus）

摘要

Deep learning models for medical image segmentation are usually trained with voxel-wise losses, e.g., cross-entropy loss, focusing on unary supervision without considering inter-voxel relationships. This oversight potentially leads to semantically inconsistent predictions. Here, we propose a contextual similarity loss (CSL) and a structural similarity loss (SSL) to explicitly and efficiently incorporate inter-voxel relationships for improved performance. The CSL promotes consistency in predicted object categories for each image sub-region compared to ground truth. The SSL enforces compatibility between the predictions of voxel pairs by computing pair-wise distances between them, ensuring that voxels of the same class are close together whereas those from different classes are separated by a wide margin in the distribution space. The effectiveness of the CSL and SSL is evaluated using a clinical cone-beam computed tomography (CBCT) dataset of patients with various craniomaxillofacial (CMF) deformities and a public pancreas dataset. Experimental results show that the CSL and SSL outperform state-of-the-art regional loss functions in preserving segmentation semantics.

源语言	英语
文章编号	110489
期刊	Pattern Recognition
卷	152
DOI	https://doi.org/10.1016/j.patcog.2024.110489
出版状态	已出版 - 8月 2024
已对外发布	是

访问文件

10.1016/j.patcog.2024.110489

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{b1f34a38774a47e98489cb9536ea037f,

title = "Improving image segmentation with contextual and structural similarity",

abstract = "Deep learning models for medical image segmentation are usually trained with voxel-wise losses, e.g., cross-entropy loss, focusing on unary supervision without considering inter-voxel relationships. This oversight potentially leads to semantically inconsistent predictions. Here, we propose a contextual similarity loss (CSL) and a structural similarity loss (SSL) to explicitly and efficiently incorporate inter-voxel relationships for improved performance. The CSL promotes consistency in predicted object categories for each image sub-region compared to ground truth. The SSL enforces compatibility between the predictions of voxel pairs by computing pair-wise distances between them, ensuring that voxels of the same class are close together whereas those from different classes are separated by a wide margin in the distribution space. The effectiveness of the CSL and SSL is evaluated using a clinical cone-beam computed tomography (CBCT) dataset of patients with various craniomaxillofacial (CMF) deformities and a public pancreas dataset. Experimental results show that the CSL and SSL outperform state-of-the-art regional loss functions in preserving segmentation semantics.",

keywords = "Cone-beam computed tomography, Image segmentation, Inter-voxel relationships, Pancreas segmentation",

author = "Xiaoyang Chen and Qin Liu and Deng, {Hannah H.} and Tianshu Kuang and Lin, {Henry Hung Ying} and Deqiang Xiao and Jaime Gateno and Xia, {James J.} and Yap, {Pew Thian}",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Ltd",

year = "2024",

month = aug,

doi = "10.1016/j.patcog.2024.110489",

language = "English",

volume = "152",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Improving image segmentation with contextual and structural similarity

AU - Chen, Xiaoyang

AU - Liu, Qin

AU - Deng, Hannah H.

AU - Kuang, Tianshu

AU - Lin, Henry Hung Ying

AU - Xiao, Deqiang

AU - Gateno, Jaime

AU - Xia, James J.

AU - Yap, Pew Thian

PY - 2024/8

Y1 - 2024/8

N2 - Deep learning models for medical image segmentation are usually trained with voxel-wise losses, e.g., cross-entropy loss, focusing on unary supervision without considering inter-voxel relationships. This oversight potentially leads to semantically inconsistent predictions. Here, we propose a contextual similarity loss (CSL) and a structural similarity loss (SSL) to explicitly and efficiently incorporate inter-voxel relationships for improved performance. The CSL promotes consistency in predicted object categories for each image sub-region compared to ground truth. The SSL enforces compatibility between the predictions of voxel pairs by computing pair-wise distances between them, ensuring that voxels of the same class are close together whereas those from different classes are separated by a wide margin in the distribution space. The effectiveness of the CSL and SSL is evaluated using a clinical cone-beam computed tomography (CBCT) dataset of patients with various craniomaxillofacial (CMF) deformities and a public pancreas dataset. Experimental results show that the CSL and SSL outperform state-of-the-art regional loss functions in preserving segmentation semantics.

AB - Deep learning models for medical image segmentation are usually trained with voxel-wise losses, e.g., cross-entropy loss, focusing on unary supervision without considering inter-voxel relationships. This oversight potentially leads to semantically inconsistent predictions. Here, we propose a contextual similarity loss (CSL) and a structural similarity loss (SSL) to explicitly and efficiently incorporate inter-voxel relationships for improved performance. The CSL promotes consistency in predicted object categories for each image sub-region compared to ground truth. The SSL enforces compatibility between the predictions of voxel pairs by computing pair-wise distances between them, ensuring that voxels of the same class are close together whereas those from different classes are separated by a wide margin in the distribution space. The effectiveness of the CSL and SSL is evaluated using a clinical cone-beam computed tomography (CBCT) dataset of patients with various craniomaxillofacial (CMF) deformities and a public pancreas dataset. Experimental results show that the CSL and SSL outperform state-of-the-art regional loss functions in preserving segmentation semantics.

KW - Cone-beam computed tomography

KW - Image segmentation

KW - Inter-voxel relationships

KW - Pancreas segmentation

UR - http://www.scopus.com/inward/record.url?scp=85189936261&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2024.110489

DO - 10.1016/j.patcog.2024.110489

M3 - Article

AN - SCOPUS:85189936261

SN - 0031-3203

VL - 152

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 110489

ER -

Improving image segmentation with contextual and structural similarity

摘要

访问文件

其它文件与链接

指纹

引用此