Improving image segmentation with contextual and structural similarity

Xiaoyang Chen; Qin Liu; Hannah H. Deng; Tianshu Kuang; Henry Hung Ying Lin; Deqiang Xiao; Jaime Gateno; James J. Xia; Pew Thian Yap

doi:10.1016/j.patcog.2024.110489

Improving image segmentation with contextual and structural similarity

Xiaoyang Chen, Qin Liu, Hannah H. Deng, Tianshu Kuang, Henry Hung Ying Lin, Deqiang Xiao, Jaime Gateno, James J. Xia, Pew Thian Yap^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

1 Citation (Scopus)

Abstract

Deep learning models for medical image segmentation are usually trained with voxel-wise losses, e.g., cross-entropy loss, focusing on unary supervision without considering inter-voxel relationships. This oversight potentially leads to semantically inconsistent predictions. Here, we propose a contextual similarity loss (CSL) and a structural similarity loss (SSL) to explicitly and efficiently incorporate inter-voxel relationships for improved performance. The CSL promotes consistency in predicted object categories for each image sub-region compared to ground truth. The SSL enforces compatibility between the predictions of voxel pairs by computing pair-wise distances between them, ensuring that voxels of the same class are close together whereas those from different classes are separated by a wide margin in the distribution space. The effectiveness of the CSL and SSL is evaluated using a clinical cone-beam computed tomography (CBCT) dataset of patients with various craniomaxillofacial (CMF) deformities and a public pancreas dataset. Experimental results show that the CSL and SSL outperform state-of-the-art regional loss functions in preserving segmentation semantics.

Original language	English
Article number	110489
Journal	Pattern Recognition
Volume	152
DOIs	https://doi.org/10.1016/j.patcog.2024.110489
Publication status	Published - Aug 2024
Externally published	Yes

Keywords

Cone-beam computed tomography
Image segmentation
Inter-voxel relationships
Pancreas segmentation

Access to Document

10.1016/j.patcog.2024.110489

Cite this

Chen, X., Liu, Q., Deng, H. H., Kuang, T., Lin, H. H. Y., Xiao, D., Gateno, J., Xia, J. J., & Yap, P. T. (2024). Improving image segmentation with contextual and structural similarity. Pattern Recognition, 152, Article 110489. https://doi.org/10.1016/j.patcog.2024.110489

@article{b1f34a38774a47e98489cb9536ea037f,

title = "Improving image segmentation with contextual and structural similarity",

abstract = "Deep learning models for medical image segmentation are usually trained with voxel-wise losses, e.g., cross-entropy loss, focusing on unary supervision without considering inter-voxel relationships. This oversight potentially leads to semantically inconsistent predictions. Here, we propose a contextual similarity loss (CSL) and a structural similarity loss (SSL) to explicitly and efficiently incorporate inter-voxel relationships for improved performance. The CSL promotes consistency in predicted object categories for each image sub-region compared to ground truth. The SSL enforces compatibility between the predictions of voxel pairs by computing pair-wise distances between them, ensuring that voxels of the same class are close together whereas those from different classes are separated by a wide margin in the distribution space. The effectiveness of the CSL and SSL is evaluated using a clinical cone-beam computed tomography (CBCT) dataset of patients with various craniomaxillofacial (CMF) deformities and a public pancreas dataset. Experimental results show that the CSL and SSL outperform state-of-the-art regional loss functions in preserving segmentation semantics.",

keywords = "Cone-beam computed tomography, Image segmentation, Inter-voxel relationships, Pancreas segmentation",

author = "Xiaoyang Chen and Qin Liu and Deng, {Hannah H.} and Tianshu Kuang and Lin, {Henry Hung Ying} and Deqiang Xiao and Jaime Gateno and Xia, {James J.} and Yap, {Pew Thian}",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier Ltd",

year = "2024",

month = aug,

doi = "10.1016/j.patcog.2024.110489",

language = "English",

volume = "152",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - Improving image segmentation with contextual and structural similarity

AU - Chen, Xiaoyang

AU - Liu, Qin

AU - Deng, Hannah H.

AU - Kuang, Tianshu

AU - Lin, Henry Hung Ying

AU - Xiao, Deqiang

AU - Gateno, Jaime

AU - Xia, James J.

AU - Yap, Pew Thian

PY - 2024/8

Y1 - 2024/8

N2 - Deep learning models for medical image segmentation are usually trained with voxel-wise losses, e.g., cross-entropy loss, focusing on unary supervision without considering inter-voxel relationships. This oversight potentially leads to semantically inconsistent predictions. Here, we propose a contextual similarity loss (CSL) and a structural similarity loss (SSL) to explicitly and efficiently incorporate inter-voxel relationships for improved performance. The CSL promotes consistency in predicted object categories for each image sub-region compared to ground truth. The SSL enforces compatibility between the predictions of voxel pairs by computing pair-wise distances between them, ensuring that voxels of the same class are close together whereas those from different classes are separated by a wide margin in the distribution space. The effectiveness of the CSL and SSL is evaluated using a clinical cone-beam computed tomography (CBCT) dataset of patients with various craniomaxillofacial (CMF) deformities and a public pancreas dataset. Experimental results show that the CSL and SSL outperform state-of-the-art regional loss functions in preserving segmentation semantics.

AB - Deep learning models for medical image segmentation are usually trained with voxel-wise losses, e.g., cross-entropy loss, focusing on unary supervision without considering inter-voxel relationships. This oversight potentially leads to semantically inconsistent predictions. Here, we propose a contextual similarity loss (CSL) and a structural similarity loss (SSL) to explicitly and efficiently incorporate inter-voxel relationships for improved performance. The CSL promotes consistency in predicted object categories for each image sub-region compared to ground truth. The SSL enforces compatibility between the predictions of voxel pairs by computing pair-wise distances between them, ensuring that voxels of the same class are close together whereas those from different classes are separated by a wide margin in the distribution space. The effectiveness of the CSL and SSL is evaluated using a clinical cone-beam computed tomography (CBCT) dataset of patients with various craniomaxillofacial (CMF) deformities and a public pancreas dataset. Experimental results show that the CSL and SSL outperform state-of-the-art regional loss functions in preserving segmentation semantics.

KW - Cone-beam computed tomography

KW - Image segmentation

KW - Inter-voxel relationships

KW - Pancreas segmentation

UR - http://www.scopus.com/inward/record.url?scp=85189936261&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2024.110489

DO - 10.1016/j.patcog.2024.110489

M3 - Article

AN - SCOPUS:85189936261

SN - 0031-3203

VL - 152

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 110489

ER -

Improving image segmentation with contextual and structural similarity

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this