TY - GEN
T1 - A Cascaded Semantic Enhancement Network Based on Attention Mechanism for Blurred Small Polyp Segmentation
AU - Lin, Mianduan
AU - Hirota, Kaoru
AU - Dai, Yaping
AU - Ji, Ye
AU - Shao, Shuai
N1 - Publisher Copyright:
© 2023 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2023
Y1 - 2023
N2 - In some polyp feature extraction algorithms based on deep learning, there is a problem of polyp semantic information degradation caused by frequent operations of feature down-sampling, which reduces the accuracy of polyp segmentation. To solve the problem of polyp semantic information degradation, a Cascaded Semantic Enhancement Network (CSENet) based on attention mechanism is proposed. There are two main parts in CSENet, Cascaded Partial Decoder (CPD) and Polyp Semantic Enhancement (PSE) module. The CPD is used to aggregate multiscale features of polyps. The PSE module is constructed based on channel attention and spatial attention to enhance the degraded semantic information of polyps. The PSE module improves CSENet's ability to segment polyps in both cases of small polyp targets and blurred polyp edges, thus improving CSENet's polyp segmentation accuracy. Experiment results show that the CSENet has best performance compare with five methods (U-Net, UNet++, SFA, PraNet and SANet), under four benchmark polyp segmentation datasets (the Kvasir dataset, the CVC-ClinicDpB dataset, the CVC-ColonDB dataset and the CVC-T dataset). In particular, compared with the SANet, CSENet improves mIoU and Fβw by 2.3% and 1.6% on the CVC-ClinicDB dataset.
AB - In some polyp feature extraction algorithms based on deep learning, there is a problem of polyp semantic information degradation caused by frequent operations of feature down-sampling, which reduces the accuracy of polyp segmentation. To solve the problem of polyp semantic information degradation, a Cascaded Semantic Enhancement Network (CSENet) based on attention mechanism is proposed. There are two main parts in CSENet, Cascaded Partial Decoder (CPD) and Polyp Semantic Enhancement (PSE) module. The CPD is used to aggregate multiscale features of polyps. The PSE module is constructed based on channel attention and spatial attention to enhance the degraded semantic information of polyps. The PSE module improves CSENet's ability to segment polyps in both cases of small polyp targets and blurred polyp edges, thus improving CSENet's polyp segmentation accuracy. Experiment results show that the CSENet has best performance compare with five methods (U-Net, UNet++, SFA, PraNet and SANet), under four benchmark polyp segmentation datasets (the Kvasir dataset, the CVC-ClinicDpB dataset, the CVC-ColonDB dataset and the CVC-T dataset). In particular, compared with the SANet, CSENet improves mIoU and Fβw by 2.3% and 1.6% on the CVC-ClinicDB dataset.
KW - Attention mechanism
KW - Deep learning
KW - Polyp image segmentation
KW - Semantic enhancement
UR - http://www.scopus.com/inward/record.url?scp=85175571568&partnerID=8YFLogxK
U2 - 10.23919/CCC58697.2023.10240309
DO - 10.23919/CCC58697.2023.10240309
M3 - Conference contribution
AN - SCOPUS:85175571568
T3 - Chinese Control Conference, CCC
SP - 8240
EP - 8245
BT - 2023 42nd Chinese Control Conference, CCC 2023
PB - IEEE Computer Society
T2 - 42nd Chinese Control Conference, CCC 2023
Y2 - 24 July 2023 through 26 July 2023
ER -