摘要
For current RGB-thermal-infrared (RGB-T) video tracking methods, the bounding box can not properly describe the target shape, which induces the parameter training not fully focus on the target area. In the aspect of feature representation, the single-layer deep learning features have difficulty in balancing both category semantic information and spatial structure information. Therefore, an RGB-T tracking algorithm with salient content perception and deep feature fusion is proposed in this article. Firstly, for the two modalities visible spectrum and thermal-infrared spectrum, the salient maps of the target are extracted and fused. Secondly, the fused salient map is used to optimize the weighting coefficient map of the spatial regularization term to highlight the influence of the training samples in the salient content region on the classifier training. Finally, the pre-trained convolution neural network is used to extract the multi-layer features of the two modalities. These features contain abundant information of sematic category and spatial structure, which are fused at the response level. Compared to the existing tracking algorithms, experimental results on the two RGB-T tracking datasets GTOT and RGBT210 demonstrate the effectiveness of the proposed algorithm. The proposed algorithm achieves the precision rates of 88.4% and 72.7%, respectively, while obtains the success rates of 71.9% and 51.0%.
| 投稿的翻译标题 | RGB-T Target Tracking Algorithm with Salient Content Perception and Deep Feature Fusion |
|---|---|
| 源语言 | 繁体中文 |
| 页(从-至) | 1999-2009 |
| 页数 | 11 |
| 期刊 | Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics |
| 卷 | 36 |
| 期 | 12 |
| DOI | |
| 出版状态 | 已出版 - 12月 2024 |
| 已对外发布 | 是 |
关键词
- RGB-T tracking
- correlation filter
- deep features
- feature fusion
- salient content perception
指纹
探究 '显著内容感知的深度特征融合 RGB-T 目标跟踪算法' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver