跳到主要导航 跳到搜索 跳到主要内容

Superpixel-based Visual Feature Enhancement for Compositional Zero-Shot Learning

  • Wenlong Du
  • , Xianglin Bao
  • , Xiaofeng Xu*
  • , Xingyu Lu
  • , Ruiheng Zhang
  • *此作品的通讯作者
  • Anhui Polytechnic University
  • Nanjing University of Science and Technology
  • Beijing Institute of Technology

科研成果: 期刊稿件文章同行评审

摘要

Compositional Zero-Shot Learning (CZSL) is a challenging machine learning task that recognizes new compositional concepts by leveraging learned concepts such as attribute-object combinations. Previous research depended on visual attributes derived from networks pre-trained in object categorization. These approaches are limited in capturing the subtleties of attribute distinctions and fail to account for the critical contextual interactions between attributes and visual objects. To address this problem, in this work, we draw inspiration from superpixels and introduce the Superpixel-based Visual Feature Enhancement (SVFE) model for the compositional zero-shot learning task. In the proposed approach, an innovative superpixel integration strategy is designed to meticulously disentangle and represent the visual concepts of states and objects with finer granularity. Then, we introduce a novel Fourier spectral layer that harnesses the frequency domain to capture global image features and dynamically adjusts component contributions to enhance the local detail representation. Furthermore, we propose a long-range fusion module to optimize the synergy between the local and global features, thereby fortifying the model’s acuity in discerning intricate compositional relationships. Through rigorous experiments on standard CZSL benchmark datasets, the proposed SVFE model demonstrates significant improvement over other state-of-the-art methods in both open-world and closed-world CZSL scenarios.

源语言英语
文章编号104414
期刊Information Processing and Management
63
2
DOI
出版状态已出版 - 3月 2026
已对外发布

指纹

探究 'Superpixel-based Visual Feature Enhancement for Compositional Zero-Shot Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此