Rega-Net: Retina Gabor Attention for Deep Convolutional Neural Networks

Chun Bao; Jie Cao; Yaqian Ning; Yang Cheng; Qun Hao

doi:10.1109/LGRS.2023.3270186

Rega-Net: Retina Gabor Attention for Deep Convolutional Neural Networks

Chun Bao, Jie Cao^*, Yaqian Ning, Yang Cheng, Qun Hao

^*Corresponding author for this work

School of Optics and Photonics

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

Abstract

Extensive research works demonstrate that the attention mechanism in convolutional neural networks (CNNs) effectively improves accuracy. Nevertheless, few works design attention mechanisms using large receptive fields. In this work, we propose a novel attention method named Rega-Net to increase CNN accuracy by enlarging the receptive field. To the best of our knowledge, increasing the receptive field of the CNN requires increasing the size of the convolution kernel, which also increases the number of parameters. For solving this problem, we design convolutional kernels to resemble the nonuniformly distributed structure inspired by the mechanism of the human retina. Then, we sample variable-resolution values in the Gabor function distribution and fill these values in retina-like kernels. This distribution allows essential features to be more visible in the center position of the receptive field. We further design an attention module including these retina-like kernels. Experiments demonstrate that our Rega-Net achieves 79.96% Top-1 accuracy on ImageNet-1k for classification and 43.1% mAP on COCO2017 for object detection. The mAP of the Rega-Net increased by up to 3.5% compared to baseline networks.

Original language	English
Article number	6004905
Journal	IEEE Geoscience and Remote Sensing Letters
Volume	20
DOIs	https://doi.org/10.1109/LGRS.2023.3270186
Publication status	Published - 2023

Keywords

Attention mechanism
Gabor
retina-like kernels

Access to Document

10.1109/LGRS.2023.3270186

Cite this

@article{97ec5dfd73c0477a8996964f585cc7e0,

title = "Rega-Net: Retina Gabor Attention for Deep Convolutional Neural Networks",

abstract = "Extensive research works demonstrate that the attention mechanism in convolutional neural networks (CNNs) effectively improves accuracy. Nevertheless, few works design attention mechanisms using large receptive fields. In this work, we propose a novel attention method named Rega-Net to increase CNN accuracy by enlarging the receptive field. To the best of our knowledge, increasing the receptive field of the CNN requires increasing the size of the convolution kernel, which also increases the number of parameters. For solving this problem, we design convolutional kernels to resemble the nonuniformly distributed structure inspired by the mechanism of the human retina. Then, we sample variable-resolution values in the Gabor function distribution and fill these values in retina-like kernels. This distribution allows essential features to be more visible in the center position of the receptive field. We further design an attention module including these retina-like kernels. Experiments demonstrate that our Rega-Net achieves 79.96% Top-1 accuracy on ImageNet-1k for classification and 43.1% mAP on COCO2017 for object detection. The mAP of the Rega-Net increased by up to 3.5% compared to baseline networks.",

keywords = "Attention mechanism, Gabor, retina-like kernels",

author = "Chun Bao and Jie Cao and Yaqian Ning and Yang Cheng and Qun Hao",

note = "Publisher Copyright: {\textcopyright} 2004-2012 IEEE.",

year = "2023",

doi = "10.1109/LGRS.2023.3270186",

language = "English",

volume = "20",

journal = "IEEE Geoscience and Remote Sensing Letters",

issn = "1545-598X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Rega-Net

T2 - Retina Gabor Attention for Deep Convolutional Neural Networks

AU - Bao, Chun

AU - Cao, Jie

AU - Ning, Yaqian

AU - Cheng, Yang

AU - Hao, Qun

PY - 2023

Y1 - 2023

N2 - Extensive research works demonstrate that the attention mechanism in convolutional neural networks (CNNs) effectively improves accuracy. Nevertheless, few works design attention mechanisms using large receptive fields. In this work, we propose a novel attention method named Rega-Net to increase CNN accuracy by enlarging the receptive field. To the best of our knowledge, increasing the receptive field of the CNN requires increasing the size of the convolution kernel, which also increases the number of parameters. For solving this problem, we design convolutional kernels to resemble the nonuniformly distributed structure inspired by the mechanism of the human retina. Then, we sample variable-resolution values in the Gabor function distribution and fill these values in retina-like kernels. This distribution allows essential features to be more visible in the center position of the receptive field. We further design an attention module including these retina-like kernels. Experiments demonstrate that our Rega-Net achieves 79.96% Top-1 accuracy on ImageNet-1k for classification and 43.1% mAP on COCO2017 for object detection. The mAP of the Rega-Net increased by up to 3.5% compared to baseline networks.

AB - Extensive research works demonstrate that the attention mechanism in convolutional neural networks (CNNs) effectively improves accuracy. Nevertheless, few works design attention mechanisms using large receptive fields. In this work, we propose a novel attention method named Rega-Net to increase CNN accuracy by enlarging the receptive field. To the best of our knowledge, increasing the receptive field of the CNN requires increasing the size of the convolution kernel, which also increases the number of parameters. For solving this problem, we design convolutional kernels to resemble the nonuniformly distributed structure inspired by the mechanism of the human retina. Then, we sample variable-resolution values in the Gabor function distribution and fill these values in retina-like kernels. This distribution allows essential features to be more visible in the center position of the receptive field. We further design an attention module including these retina-like kernels. Experiments demonstrate that our Rega-Net achieves 79.96% Top-1 accuracy on ImageNet-1k for classification and 43.1% mAP on COCO2017 for object detection. The mAP of the Rega-Net increased by up to 3.5% compared to baseline networks.

KW - Attention mechanism

KW - Gabor

KW - retina-like kernels

UR - http://www.scopus.com/inward/record.url?scp=85159721586&partnerID=8YFLogxK

U2 - 10.1109/LGRS.2023.3270186

DO - 10.1109/LGRS.2023.3270186

M3 - Article

AN - SCOPUS:85159721586

SN - 1545-598X

VL - 20

JO - IEEE Geoscience and Remote Sensing Letters

JF - IEEE Geoscience and Remote Sensing Letters

M1 - 6004905

ER -

Rega-Net: Retina Gabor Attention for Deep Convolutional Neural Networks

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this