TY - JOUR
T1 - Wide-Context Attention Network for Remote Sensing Image Retrieval
AU - Wang, Honghu
AU - Zhou, Zhiqiang
AU - Zong, Hua
AU - Miao, Lingjuan
N1 - Publisher Copyright:
© 2004-2012 IEEE.
PY - 2021/12/1
Y1 - 2021/12/1
N2 - Remote sensing image retrieval (RSIR) has broad application prospects, but related challenges still exist. One of the most important challenges is how to obtain discriminative features. In recent years, although the powerful feature learning ability of convolutional neural networks (CNNs) has significantly improved RSIR, their performance can be restricted by the complexity of remote sensing (RS) images, such as small objects, varying scales, and wide scope. To address these problems, we propose a novel wide-context attention network (W-CAN). It leverages two attention modules to adaptively learn local features correlated in the spatial and channel dimensions, respectively, which can obtain discriminative features with extensive context information. During training, a hybrid loss is introduced to enhance the intraclass compactness and interclass separability of the features. Moreover, we add a branch to learn binary descriptors and realize the end-to-end descriptor aggregation. Experiments on four RS benchmark data sets demonstrate that the proposed method can outperform some state-of-the-art RSIR methods.
AB - Remote sensing image retrieval (RSIR) has broad application prospects, but related challenges still exist. One of the most important challenges is how to obtain discriminative features. In recent years, although the powerful feature learning ability of convolutional neural networks (CNNs) has significantly improved RSIR, their performance can be restricted by the complexity of remote sensing (RS) images, such as small objects, varying scales, and wide scope. To address these problems, we propose a novel wide-context attention network (W-CAN). It leverages two attention modules to adaptively learn local features correlated in the spatial and channel dimensions, respectively, which can obtain discriminative features with extensive context information. During training, a hybrid loss is introduced to enhance the intraclass compactness and interclass separability of the features. Moreover, we add a branch to learn binary descriptors and realize the end-to-end descriptor aggregation. Experiments on four RS benchmark data sets demonstrate that the proposed method can outperform some state-of-the-art RSIR methods.
KW - Attention network
KW - convolutional neural networks (CNNs)
KW - remote sensing image retrieval (RSIR)
UR - http://www.scopus.com/inward/record.url?scp=85120425297&partnerID=8YFLogxK
U2 - 10.1109/LGRS.2020.3015951
DO - 10.1109/LGRS.2020.3015951
M3 - Article
AN - SCOPUS:85120425297
SN - 1545-598X
VL - 18
SP - 2082
EP - 2086
JO - IEEE Geoscience and Remote Sensing Letters
JF - IEEE Geoscience and Remote Sensing Letters
IS - 12
ER -