Feature Alignment in Anchor-Free Object Detection

Feng Gao; Yeyun Cai; Fang Deng; Chengpu Yu; Jie Chen

doi:10.1109/TCSVT.2023.3241993

Feature Alignment in Anchor-Free Object Detection

Feng Gao, Yeyun Cai, Fang Deng^*, Chengpu Yu, Jie Chen

^*此作品的通讯作者

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

13 引用（Scopus）

摘要

Most anchor-free methods perform object detection using dense recommendation, which assumes that one point can simultaneously conduct accurate category prediction and regression estimation. However, due to different task drivers, valid features for classification and regression may locate at distinct areas in the training phase. This problem is called feature misalignment. To solve it, we propose a new feature alignment method based on anchor-free object detector. Firstly, a global receptive field adaptor (G-RFA) is designed by incorporating the feature pyramid networks (FPN) with the global attention mechanism, and forward features are further fine-tuned with a deformable-subnet (De-Subnet) to remove the influence of redundant contextual information. Then, a new feature filter strategy with a misalignment score is proposed to guide the network to focus on sampling points with aligned features. In addition, we establish mutually independent multi-layer quality distributions to model the priori information of an object on different FPN levels. Equipped with our method, the classification and regression features are aligned, and the generated foreground weight map converges to the centers of classification and regression heatmaps. Experimental results show that without bells and whistles, our method achieves 49.3% AP on MS COCO test-dev under the default 2× training schedule, outperforming related methods. Besides, experiments on PASCAL VOC demonstrate the generalization ability of our method. Code is available at https://github.com/GFENGG/featurealign.

源语言	英语
页（从-至）	3799-3810
页数	12
期刊	IEEE Transactions on Circuits and Systems for Video Technology
卷	33
期	8
DOI	https://doi.org/10.1109/TCSVT.2023.3241993
出版状态	已出版 - 1 8月 2023

访问文件

10.1109/TCSVT.2023.3241993

其它文件与链接

链接到 Scopus 的出版物

引用此

Gao, F., Cai, Y., Deng, F., Yu, C., & Chen, J. (2023). Feature Alignment in Anchor-Free Object Detection. IEEE Transactions on Circuits and Systems for Video Technology, 33(8), 3799-3810. https://doi.org/10.1109/TCSVT.2023.3241993

@article{fefa05d0b4fa427ea6d166d25bada138,

title = "Feature Alignment in Anchor-Free Object Detection",

abstract = "Most anchor-free methods perform object detection using dense recommendation, which assumes that one point can simultaneously conduct accurate category prediction and regression estimation. However, due to different task drivers, valid features for classification and regression may locate at distinct areas in the training phase. This problem is called feature misalignment. To solve it, we propose a new feature alignment method based on anchor-free object detector. Firstly, a global receptive field adaptor (G-RFA) is designed by incorporating the feature pyramid networks (FPN) with the global attention mechanism, and forward features are further fine-tuned with a deformable-subnet (De-Subnet) to remove the influence of redundant contextual information. Then, a new feature filter strategy with a misalignment score is proposed to guide the network to focus on sampling points with aligned features. In addition, we establish mutually independent multi-layer quality distributions to model the priori information of an object on different FPN levels. Equipped with our method, the classification and regression features are aligned, and the generated foreground weight map converges to the centers of classification and regression heatmaps. Experimental results show that without bells and whistles, our method achieves 49.3% AP on MS COCO test-dev under the default 2× training schedule, outperforming related methods. Besides, experiments on PASCAL VOC demonstrate the generalization ability of our method. Code is available at https://github.com/GFENGG/featurealign.",

keywords = "Object detection, anchor-free models, feature alignment",

author = "Feng Gao and Yeyun Cai and Fang Deng and Chengpu Yu and Jie Chen",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2023",

month = aug,

day = "1",

doi = "10.1109/TCSVT.2023.3241993",

language = "English",

volume = "33",

pages = "3799--3810",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "8",

}

TY - JOUR

T1 - Feature Alignment in Anchor-Free Object Detection

AU - Gao, Feng

AU - Cai, Yeyun

AU - Deng, Fang

AU - Yu, Chengpu

AU - Chen, Jie

PY - 2023/8/1

Y1 - 2023/8/1

N2 - Most anchor-free methods perform object detection using dense recommendation, which assumes that one point can simultaneously conduct accurate category prediction and regression estimation. However, due to different task drivers, valid features for classification and regression may locate at distinct areas in the training phase. This problem is called feature misalignment. To solve it, we propose a new feature alignment method based on anchor-free object detector. Firstly, a global receptive field adaptor (G-RFA) is designed by incorporating the feature pyramid networks (FPN) with the global attention mechanism, and forward features are further fine-tuned with a deformable-subnet (De-Subnet) to remove the influence of redundant contextual information. Then, a new feature filter strategy with a misalignment score is proposed to guide the network to focus on sampling points with aligned features. In addition, we establish mutually independent multi-layer quality distributions to model the priori information of an object on different FPN levels. Equipped with our method, the classification and regression features are aligned, and the generated foreground weight map converges to the centers of classification and regression heatmaps. Experimental results show that without bells and whistles, our method achieves 49.3% AP on MS COCO test-dev under the default 2× training schedule, outperforming related methods. Besides, experiments on PASCAL VOC demonstrate the generalization ability of our method. Code is available at https://github.com/GFENGG/featurealign.

AB - Most anchor-free methods perform object detection using dense recommendation, which assumes that one point can simultaneously conduct accurate category prediction and regression estimation. However, due to different task drivers, valid features for classification and regression may locate at distinct areas in the training phase. This problem is called feature misalignment. To solve it, we propose a new feature alignment method based on anchor-free object detector. Firstly, a global receptive field adaptor (G-RFA) is designed by incorporating the feature pyramid networks (FPN) with the global attention mechanism, and forward features are further fine-tuned with a deformable-subnet (De-Subnet) to remove the influence of redundant contextual information. Then, a new feature filter strategy with a misalignment score is proposed to guide the network to focus on sampling points with aligned features. In addition, we establish mutually independent multi-layer quality distributions to model the priori information of an object on different FPN levels. Equipped with our method, the classification and regression features are aligned, and the generated foreground weight map converges to the centers of classification and regression heatmaps. Experimental results show that without bells and whistles, our method achieves 49.3% AP on MS COCO test-dev under the default 2× training schedule, outperforming related methods. Besides, experiments on PASCAL VOC demonstrate the generalization ability of our method. Code is available at https://github.com/GFENGG/featurealign.

KW - Object detection

KW - anchor-free models

KW - feature alignment

UR - http://www.scopus.com/inward/record.url?scp=85148440959&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2023.3241993

DO - 10.1109/TCSVT.2023.3241993

M3 - Article

AN - SCOPUS:85148440959

SN - 1051-8215

VL - 33

SP - 3799

EP - 3810

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 8

ER -

Feature Alignment in Anchor-Free Object Detection

摘要

访问文件

其它文件与链接

指纹

引用此