ALMA: Adjustable Location and Multi-Angle Attention for Fine-Grained Visual Classification

Boyu Ding, Xiaofeng Xu*, Xianglin Bao, Nan Yan, Ruiheng Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Fine-grained visual classification (FGVC) is a challenging but realistic problem that recognizes objects from common categories with subtle differences. Most previous work focused on identifying more regional features while neglecting the fact that these regions still contain a large amount of secondary information. To alleviate the interference of the secondary information, in this paper, we propose a novel Adjustable Location and Multi-angle Attention (ALMA) network to solve the FGVC problem. ALMA consists of two branches, i.e. the adjustable location module and the multi-angle attention module. Specifically, in the adjustable localization module, we first locate the interested area of the object and obtain the adjusted cropped area by adjusting the interested area through the background masking. Then, the adjusted regions will be gathered to locate objects with better prediction performance. Furthermore, we design the multi-angle attention module to gradually maximize the difference between the original attention map and the randomly selected attention map. Consequently, the model can focus on the main information which represents the entire object. To evaluate the effectiveness of the proposed model, we conduct extensive experiments on three public fine-grained benchmark datasets. Experimental results demonstrate that the proposed ALMA model has significant superiority over other FGVC methods.

Original languageEnglish
Title of host publicationProceedings of the 2024 27th International Conference on Computer Supported Cooperative Work in Design, CSCWD 2024
EditorsWeiming Shen, Weiming Shen, Jean-Paul Barthes, Junzhou Luo, Tie Qiu, Xiaobo Zhou, Jinghui Zhang, Haibin Zhu, Kunkun Peng, Tianyi Xu, Ning Chen
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages2967-2972
Number of pages6
ISBN (Electronic)9798350349184
DOIs
Publication statusPublished - 2024
Event27th International Conference on Computer Supported Cooperative Work in Design, CSCWD 2024 - Tianjin, China
Duration: 8 May 202410 May 2024

Publication series

NameProceedings of the 2024 27th International Conference on Computer Supported Cooperative Work in Design, CSCWD 2024

Conference

Conference27th International Conference on Computer Supported Cooperative Work in Design, CSCWD 2024
Country/TerritoryChina
CityTianjin
Period8/05/2410/05/24

Keywords

  • Adjustable location
  • Background masking
  • Fine-grained visual classification
  • Image cropping
  • Multi-angle attention

Fingerprint

Dive into the research topics of 'ALMA: Adjustable Location and Multi-Angle Attention for Fine-Grained Visual Classification'. Together they form a unique fingerprint.

Cite this