Region-adaptive Concept Aggregation for Few-shot Visual Recognition

Mengya Han; Yibing Zhan; Baosheng Yu; Yong Luo; Han Hu; Bo Du; Yonggang Wen; Dacheng Tao

doi:10.1007/s11633-022-1358-8

Region-adaptive Concept Aggregation for Few-shot Visual Recognition

Mengya Han, Yibing Zhan, Baosheng Yu, Yong Luo^*, Han Hu, Bo Du, Yonggang Wen, Dacheng Tao

^*此作品的通讯作者

信息与电子学院

科研成果: 期刊稿件 › 文章 › 同行评审

4 引用（Scopus）

摘要

Few-shot learning (FSL) aims to learn novel concepts from very limited examples. However, most FSL methods suffer from the issue of lacking robustness in concept learning. Specifically, existing FSL methods usually ignore the diversity of region contents that may contain concept-irrelevant information such as the background, which would introduce bias/noise and degrade the performance of conceptual representation learning. To address the above-mentioned issue, we propose a novel metric-based FSL method termed region-adaptive concept aggregation network or RCA-Net. Specifically, we devise a region-adaptive concept aggregator (RCA) to model the relationships of different regions and capture the conceptual information in different regions, which are then integrated in a weighted average manner to obtain the conceptual representation. Consequently, robust concept learning can be achieved by focusing more on the concept-relevant information and less on the conceptual-irrelevant information. We perform extensive experiments on three popular visual recognition benchmarks to demonstrate the superiority of RCA-Net for robust few-shot learning. In particular, on the Caltech-UCSD Birds-200-2011 (CUB200) dataset, the proposed RCA-Net significantly improves 1-shot accuracy from 74.76% to 78.03% and 5-shot accuracy from 86.84% to 89.83% compared with the most competitive counterpart.

源语言	英语
页（从-至）	554-568
页数	15
期刊	Machine Intelligence Research
卷	20
期	4
DOI	https://doi.org/10.1007/s11633-022-1358-8
出版状态	已出版 - 8月 2023

访问文件

10.1007/s11633-022-1358-8

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{9a5a6c7ca5d943c8acd09c1e712961fa,

title = "Region-adaptive Concept Aggregation for Few-shot Visual Recognition",

abstract = "Few-shot learning (FSL) aims to learn novel concepts from very limited examples. However, most FSL methods suffer from the issue of lacking robustness in concept learning. Specifically, existing FSL methods usually ignore the diversity of region contents that may contain concept-irrelevant information such as the background, which would introduce bias/noise and degrade the performance of conceptual representation learning. To address the above-mentioned issue, we propose a novel metric-based FSL method termed region-adaptive concept aggregation network or RCA-Net. Specifically, we devise a region-adaptive concept aggregator (RCA) to model the relationships of different regions and capture the conceptual information in different regions, which are then integrated in a weighted average manner to obtain the conceptual representation. Consequently, robust concept learning can be achieved by focusing more on the concept-relevant information and less on the conceptual-irrelevant information. We perform extensive experiments on three popular visual recognition benchmarks to demonstrate the superiority of RCA-Net for robust few-shot learning. In particular, on the Caltech-UCSD Birds-200-2011 (CUB200) dataset, the proposed RCA-Net significantly improves 1-shot accuracy from 74.76% to 78.03% and 5-shot accuracy from 86.84% to 89.83% compared with the most competitive counterpart.",

keywords = "Few-shot learning, concept learning, concept-aggregation, metric-based meta learning, region-adaptive",

author = "Mengya Han and Yibing Zhan and Baosheng Yu and Yong Luo and Han Hu and Bo Du and Yonggang Wen and Dacheng Tao",

note = "Publisher Copyright: {\textcopyright} 2023, Institute of Automation, Chinese Academy of Sciences and Springer-Verlag GmbH Germany, part of Springer Nature.",

year = "2023",

month = aug,

doi = "10.1007/s11633-022-1358-8",

language = "English",

volume = "20",

pages = "554--568",

journal = "Machine Intelligence Research",

issn = "2731-538X",

publisher = "Chinese Academy of Sciences",

number = "4",

}

TY - JOUR

T1 - Region-adaptive Concept Aggregation for Few-shot Visual Recognition

AU - Han, Mengya

AU - Zhan, Yibing

AU - Yu, Baosheng

AU - Luo, Yong

AU - Hu, Han

AU - Du, Bo

AU - Wen, Yonggang

AU - Tao, Dacheng

PY - 2023/8

Y1 - 2023/8

N2 - Few-shot learning (FSL) aims to learn novel concepts from very limited examples. However, most FSL methods suffer from the issue of lacking robustness in concept learning. Specifically, existing FSL methods usually ignore the diversity of region contents that may contain concept-irrelevant information such as the background, which would introduce bias/noise and degrade the performance of conceptual representation learning. To address the above-mentioned issue, we propose a novel metric-based FSL method termed region-adaptive concept aggregation network or RCA-Net. Specifically, we devise a region-adaptive concept aggregator (RCA) to model the relationships of different regions and capture the conceptual information in different regions, which are then integrated in a weighted average manner to obtain the conceptual representation. Consequently, robust concept learning can be achieved by focusing more on the concept-relevant information and less on the conceptual-irrelevant information. We perform extensive experiments on three popular visual recognition benchmarks to demonstrate the superiority of RCA-Net for robust few-shot learning. In particular, on the Caltech-UCSD Birds-200-2011 (CUB200) dataset, the proposed RCA-Net significantly improves 1-shot accuracy from 74.76% to 78.03% and 5-shot accuracy from 86.84% to 89.83% compared with the most competitive counterpart.

AB - Few-shot learning (FSL) aims to learn novel concepts from very limited examples. However, most FSL methods suffer from the issue of lacking robustness in concept learning. Specifically, existing FSL methods usually ignore the diversity of region contents that may contain concept-irrelevant information such as the background, which would introduce bias/noise and degrade the performance of conceptual representation learning. To address the above-mentioned issue, we propose a novel metric-based FSL method termed region-adaptive concept aggregation network or RCA-Net. Specifically, we devise a region-adaptive concept aggregator (RCA) to model the relationships of different regions and capture the conceptual information in different regions, which are then integrated in a weighted average manner to obtain the conceptual representation. Consequently, robust concept learning can be achieved by focusing more on the concept-relevant information and less on the conceptual-irrelevant information. We perform extensive experiments on three popular visual recognition benchmarks to demonstrate the superiority of RCA-Net for robust few-shot learning. In particular, on the Caltech-UCSD Birds-200-2011 (CUB200) dataset, the proposed RCA-Net significantly improves 1-shot accuracy from 74.76% to 78.03% and 5-shot accuracy from 86.84% to 89.83% compared with the most competitive counterpart.

KW - Few-shot learning

KW - concept learning

KW - concept-aggregation

KW - metric-based meta learning

KW - region-adaptive

UR - http://www.scopus.com/inward/record.url?scp=85149140622&partnerID=8YFLogxK

U2 - 10.1007/s11633-022-1358-8

DO - 10.1007/s11633-022-1358-8

M3 - Article

AN - SCOPUS:85149140622

SN - 2731-538X

VL - 20

SP - 554

EP - 568

JO - Machine Intelligence Research

JF - Machine Intelligence Research

IS - 4

ER -

Region-adaptive Concept Aggregation for Few-shot Visual Recognition

摘要

访问文件

其它文件与链接

指纹

引用此