跳到主要导航 跳到搜索 跳到主要内容

SD2-ReID: A semantic-stylistic decoupled distillation framework for robust multi-modal object re-identification

  • Yonghao Yan
  • , Meijing Gao*
  • , Yang Bai
  • , Xu Chen
  • , Bingzhou Sun
  • , Huanyu Sun
  • , Sibo Chen
  • *此作品的通讯作者
  • Beijing Institute of Technology
  • Yanshan University
  • Beijing Institute of Aerospace Information

科研成果: 期刊稿件文章同行评审

摘要

The core challenge of multi-modal object re-identification (ReID) lies in reconciling the style discrepancies across different modalities with the semantic consistency of identity. However, existing methods are difficult to effectively separate semantic features from modality-specific styles, resulting in semantic representations being contaminated by noise and affecting recognition performance. To address the above issues, we propose a multi-modal re-identification framework based on semantic-stylistic decoupled distillation, named SD2-ReID (Semantic-Stylistic Decoupled Distillation for ReID), aiming to improve modal consistency and cross-modal semantic discrimination. Firstly, we design a Hybrid Multi-modal Feature Extractor (HMFE) that employs a shared shallow structure and modality-specific deep branches to achieve fine-grained feature extraction, thereby improving learning efficiency while preserving modality-specific characteristics; secondly, we design a Decoupled Distillation Module (DDM) that explicitly separates semantic and stylistic features through dual constraints of semantic and style distillation, improving cross-modal semantic consistency and discriminative ability; finally, we propose an attention-guided masking strategy and integrate intra-modal and cross-modal contrastive learning to construct a Hierarchical Self-supervised Learning Module (HSLM), thereby enhancing the model’s robustness to local occlusions and style variations.The synergistic enhancement of semantic consistency, modal invariance and feature robustness is finally realized. Unlike existing methods, SD2-ReID does not require the design of a multi-modal fusion module and does not introduce additional overhead in the inference phase, while balancing recognition performance and inference efficiency. Experiments on three multi-modal object ReID benchmark test sets fully validate the effectiveness of our method.

源语言英语
文章编号108719
期刊Neural Networks
198
DOI
出版状态已出版 - 6月 2026

指纹

探究 'SD2-ReID: A semantic-stylistic decoupled distillation framework for robust multi-modal object re-identification' 的科研主题。它们共同构成独一无二的指纹。

引用此