跳到主要导航 跳到搜索 跳到主要内容

Bring KANs to Tracker: A Nonlinear Fusion Strategy Over Frequency Decoupling for Temporal Hyperspectral Object Tracking

  • Hanzheng Wang
  • , Wei Li*
  • , Xiang Gen Xia
  • , Bolun Cui
  • , Zhicheng Shi
  • , Hongyang Lin
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • National Key Laboratory of Science and Technology on Space-Born Intelligent Information Processing
  • University of Delaware
  • China Aerospace Science and Technology Corporation
  • Changchun Champion Optics Company Ltd.

科研成果: 期刊稿件文章同行评审

摘要

Hyperspectral (HS) cameras have great potential in extracting spectral, textual, and temporal information from objects. Many existing works leverage HS data for object tracking, as it provides unique spectral features that can help address challenges like background clutter (BC) or camouflage. However, most of these methods overlook the rich temporal information available in video sequences, and many spectral–visual fusion approaches fail to extract contextual information from a global perspective, causing an insufficient understanding of an entire object by the model. To address the above issues, a spectral–temporal tracking Transformer based on a frequency-domain fusion strategy (S3T-FFS) is proposed. First, a spectral–temporal token is proposed to capture an object’s spectral information that remains unchanged in video clips, providing additional tracking cues. Second, to extract spectral semantic information, we propose a frequency-domain fusion strategy (FFS), including a frequency attention network (FAN) and a Kolmogorov–Arnold network-based convolutional unit (CuKAN), to provide spectral information for the tracking model. Specifically, FAN is designed for the simultaneous extraction and fusion of spectral features. This synchronous modeling approach decouples low-frequency and high-frequency information, adjusting their balance through frequency-domain prior knowledge and self-adaptive weights. This allows the fusion network to focus more on global information. To further extract global patterns from the low-frequency features and improve the network’s interpretability, we introduce CuKAN to extract nonlinear relationships within the decoupled frequency components, and its learnable activation function helps the model learn global patterns, thus avoiding the local overfitting in convolutional neural networks. Extensive experiments on multiple large-scale datasets illustrate the effectiveness of our proposed methods.

源语言英语
文章编号5502714
期刊IEEE Transactions on Geoscience and Remote Sensing
64
DOI
出版状态已出版 - 2026
已对外发布

指纹

探究 'Bring KANs to Tracker: A Nonlinear Fusion Strategy Over Frequency Decoupling for Temporal Hyperspectral Object Tracking' 的科研主题。它们共同构成独一无二的指纹。

引用此