Unbalanced Class Learning Network With Scale-Adaptive Perception for Complicated Scene in Remote Sensing Images Segmentation

He Wang; Mengmeng Zhang; Wei Li; Yunhao Gao; Yuanyuan Gui; Yuxiang Zhang

doi:10.1109/TGRS.2024.3388528

Unbalanced Class Learning Network With Scale-Adaptive Perception for Complicated Scene in Remote Sensing Images Segmentation

He Wang, Mengmeng Zhang^*, Wei Li, Yunhao Gao, Yuanyuan Gui, Yuxiang Zhang

^*此作品的通讯作者

信息与电子学院

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

The semantic segmentation of wide-field remote sensing images (RSIs) plays a significant role in many fields. However, due to the complexity of the content of RSIs, the dataset often has an uneven distribution of land type between different classes and large gaps in the scales of different objects. This often creates great problems for fine segmentation. To solve the issues, an unbalanced class learning network with scale-adaptive perception (UCSANet) is proposed, which can adaptively cope with multiscale objects and unbalanced classes. The design can be inserted in any convolution network easily and can enrich features without increasing too many parameters. The network groups feature and use atrous convolutions with different dilated rates on different groups to extract multiscale features while separable convolutions reduce the amount of network parameters. Then, the fusion of features between different scales is achieved through the self-attention mechanism. Furthermore, a weight map is designed to adaptively combine the predictions of two segmentation heads with cross-entropy loss and Lovasz-Softmax loss, respectively, which enable the network to focus on learning low-frequency classes without affecting high-frequency classes. Experimental results on GF-6 MSI datasets demonstrate that the proposed UCSANet performs significantly better than others and achieves multiclass segmentation more accurately.

源语言	英语
文章编号	4406712
页（从-至）	1-12
页数	12
期刊	IEEE Transactions on Geoscience and Remote Sensing
卷	62
DOI	https://doi.org/10.1109/TGRS.2024.3388528
出版状态	已出版 - 2024

访问文件

10.1109/TGRS.2024.3388528

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{acf3c52b46a84a45aefb644701c7afd8,

title = "Unbalanced Class Learning Network With Scale-Adaptive Perception for Complicated Scene in Remote Sensing Images Segmentation",

abstract = "The semantic segmentation of wide-field remote sensing images (RSIs) plays a significant role in many fields. However, due to the complexity of the content of RSIs, the dataset often has an uneven distribution of land type between different classes and large gaps in the scales of different objects. This often creates great problems for fine segmentation. To solve the issues, an unbalanced class learning network with scale-adaptive perception (UCSANet) is proposed, which can adaptively cope with multiscale objects and unbalanced classes. The design can be inserted in any convolution network easily and can enrich features without increasing too many parameters. The network groups feature and use atrous convolutions with different dilated rates on different groups to extract multiscale features while separable convolutions reduce the amount of network parameters. Then, the fusion of features between different scales is achieved through the self-attention mechanism. Furthermore, a weight map is designed to adaptively combine the predictions of two segmentation heads with cross-entropy loss and Lovasz-Softmax loss, respectively, which enable the network to focus on learning low-frequency classes without affecting high-frequency classes. Experimental results on GF-6 MSI datasets demonstrate that the proposed UCSANet performs significantly better than others and achieves multiclass segmentation more accurately.",

keywords = "Deep learning, scale-adaptive, semantic segmentation, unbalanced data",

author = "He Wang and Mengmeng Zhang and Wei Li and Yunhao Gao and Yuanyuan Gui and Yuxiang Zhang",

note = "Publisher Copyright: {\textcopyright} 1980-2012 IEEE.",

year = "2024",

doi = "10.1109/TGRS.2024.3388528",

language = "English",

volume = "62",

pages = "1--12",

journal = "IEEE Transactions on Geoscience and Remote Sensing",

issn = "0196-2892",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Unbalanced Class Learning Network With Scale-Adaptive Perception for Complicated Scene in Remote Sensing Images Segmentation

AU - Wang, He

AU - Zhang, Mengmeng

AU - Li, Wei

AU - Gao, Yunhao

AU - Gui, Yuanyuan

AU - Zhang, Yuxiang

PY - 2024

Y1 - 2024

N2 - The semantic segmentation of wide-field remote sensing images (RSIs) plays a significant role in many fields. However, due to the complexity of the content of RSIs, the dataset often has an uneven distribution of land type between different classes and large gaps in the scales of different objects. This often creates great problems for fine segmentation. To solve the issues, an unbalanced class learning network with scale-adaptive perception (UCSANet) is proposed, which can adaptively cope with multiscale objects and unbalanced classes. The design can be inserted in any convolution network easily and can enrich features without increasing too many parameters. The network groups feature and use atrous convolutions with different dilated rates on different groups to extract multiscale features while separable convolutions reduce the amount of network parameters. Then, the fusion of features between different scales is achieved through the self-attention mechanism. Furthermore, a weight map is designed to adaptively combine the predictions of two segmentation heads with cross-entropy loss and Lovasz-Softmax loss, respectively, which enable the network to focus on learning low-frequency classes without affecting high-frequency classes. Experimental results on GF-6 MSI datasets demonstrate that the proposed UCSANet performs significantly better than others and achieves multiclass segmentation more accurately.

AB - The semantic segmentation of wide-field remote sensing images (RSIs) plays a significant role in many fields. However, due to the complexity of the content of RSIs, the dataset often has an uneven distribution of land type between different classes and large gaps in the scales of different objects. This often creates great problems for fine segmentation. To solve the issues, an unbalanced class learning network with scale-adaptive perception (UCSANet) is proposed, which can adaptively cope with multiscale objects and unbalanced classes. The design can be inserted in any convolution network easily and can enrich features without increasing too many parameters. The network groups feature and use atrous convolutions with different dilated rates on different groups to extract multiscale features while separable convolutions reduce the amount of network parameters. Then, the fusion of features between different scales is achieved through the self-attention mechanism. Furthermore, a weight map is designed to adaptively combine the predictions of two segmentation heads with cross-entropy loss and Lovasz-Softmax loss, respectively, which enable the network to focus on learning low-frequency classes without affecting high-frequency classes. Experimental results on GF-6 MSI datasets demonstrate that the proposed UCSANet performs significantly better than others and achieves multiclass segmentation more accurately.

KW - Deep learning

KW - scale-adaptive

KW - semantic segmentation

KW - unbalanced data

UR - http://www.scopus.com/inward/record.url?scp=85190730771&partnerID=8YFLogxK

U2 - 10.1109/TGRS.2024.3388528

DO - 10.1109/TGRS.2024.3388528

M3 - Article

AN - SCOPUS:85190730771

SN - 0196-2892

VL - 62

SP - 1

EP - 12

JO - IEEE Transactions on Geoscience and Remote Sensing

JF - IEEE Transactions on Geoscience and Remote Sensing

M1 - 4406712

ER -

Unbalanced Class Learning Network With Scale-Adaptive Perception for Complicated Scene in Remote Sensing Images Segmentation

摘要

访问文件

其它文件与链接

指纹

引用此