TY - JOUR
T1 - MMHCA
T2 - Multi-feature representations based on multi-scale hierarchical contextual aggregation for UAV-view geo-localization
AU - CHEN, Nanhua
AU - LOU, Tai shan
AU - ZHAO, Liangyu
N1 - Publisher Copyright:
© 2024
PY - 2025/6
Y1 - 2025/6
N2 - In global navigation satellite system denial environment, cross-view geo-localization based on image retrieval presents an exceedingly critical visual localization solution for Unmanned Aerial Vehicle (UAV) systems. The essence of cross-view geo-localization resides in matching images containing the same geographical targets from disparate platforms, such as UAV-view and satellite-view images. However, images of the same geographical targets may suffer from occlusions and geometric distortions due to variations in the capturing platform, view, and timing. The existing methods predominantly extract features by segmenting feature maps, which overlook the holistic semantic distribution and structural information of objects, resulting in loss of image information. To address these challenges, dilated neighborhood attention Transformer is employed as the feature extraction backbone, and Multi-feature representations based on Multi-scale Hierarchical Contextual Aggregation (MMHCA) is proposed. In the proposed MMHCA method, the multi-scale hierarchical contextual aggregation method is utilized to extract contextual information from local to global across various granularity levels, establishing feature associations of contextual information with global and local information in the image. Subsequently, the multi-feature representations method is utilized to obtain rich discriminative feature information, bolstering the robustness of model in scenarios characterized by positional shifts, varying distances, and scale ambiguities. Comprehensive experiments conducted on the extensively utilized University-1652 and SUES-200 benchmarks indicate that the MMHCA method surpasses the existing techniques, showing outstanding results in UAV localization and navigation.
AB - In global navigation satellite system denial environment, cross-view geo-localization based on image retrieval presents an exceedingly critical visual localization solution for Unmanned Aerial Vehicle (UAV) systems. The essence of cross-view geo-localization resides in matching images containing the same geographical targets from disparate platforms, such as UAV-view and satellite-view images. However, images of the same geographical targets may suffer from occlusions and geometric distortions due to variations in the capturing platform, view, and timing. The existing methods predominantly extract features by segmenting feature maps, which overlook the holistic semantic distribution and structural information of objects, resulting in loss of image information. To address these challenges, dilated neighborhood attention Transformer is employed as the feature extraction backbone, and Multi-feature representations based on Multi-scale Hierarchical Contextual Aggregation (MMHCA) is proposed. In the proposed MMHCA method, the multi-scale hierarchical contextual aggregation method is utilized to extract contextual information from local to global across various granularity levels, establishing feature associations of contextual information with global and local information in the image. Subsequently, the multi-feature representations method is utilized to obtain rich discriminative feature information, bolstering the robustness of model in scenarios characterized by positional shifts, varying distances, and scale ambiguities. Comprehensive experiments conducted on the extensively utilized University-1652 and SUES-200 benchmarks indicate that the MMHCA method surpasses the existing techniques, showing outstanding results in UAV localization and navigation.
KW - Geo-localization
KW - Hierarchical contextual aggregation
KW - Image retrieval
KW - Multi-feature representations
KW - UAV
UR - http://www.scopus.com/inward/record.url?scp=105004798738&partnerID=8YFLogxK
U2 - 10.1016/j.cja.2024.09.018
DO - 10.1016/j.cja.2024.09.018
M3 - Article
AN - SCOPUS:105004798738
SN - 1000-9361
VL - 38
JO - Chinese Journal of Aeronautics
JF - Chinese Journal of Aeronautics
IS - 6
M1 - 103242
ER -