TY - GEN
T1 - Attention-based Dynamic Filters for Anchor-free Instance Segmentation
AU - Zhang, Tong
AU - Zhang, Guoshan
AU - Yan, Min
AU - Zhang, Yueming
N1 - Publisher Copyright:
© 2021 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2021/7/26
Y1 - 2021/7/26
N2 - The convolution operation is the core of convolutional neural networks (CNNs). To make CNNs more efficient, existing works construct multi-scale representation by utilizing different filter sizes or expanding filter sizes with dilated convolutions. However, these filters have fixed parameters after training so that they are not adaptive to the input image during inference. To address this issue, we propose an attention-based dynamic filter, which is a novel design that adaptively generates filters based on image contents. We apply the proposed dynamic filter to the mask branch, named Attention-based Adaptive Con-guided mask (ACG-Mask) branch, which is added to anchor-free one-stage object detector (FCOS). Besides, we design a multi-scale head, which contains an improved Receptive Field Block (iRFB) to enhance the discriminability and robustness of the feature. We name our model as Attention-based Dynamic Filters for anchor-free Instance Segmentation (ADFInst). Extensive experiments on the fine-annotation Cityscapes and COCO datasets reveal the effectiveness of the proposed method. ADFInst achieves a new record 37.9% AP and 63.3% AP50 on the fine-annotation Cityscapes dataset and achieves 37.8% AP, 58.7% AP50, and 40.5% AP75 on COCO dataset.
AB - The convolution operation is the core of convolutional neural networks (CNNs). To make CNNs more efficient, existing works construct multi-scale representation by utilizing different filter sizes or expanding filter sizes with dilated convolutions. However, these filters have fixed parameters after training so that they are not adaptive to the input image during inference. To address this issue, we propose an attention-based dynamic filter, which is a novel design that adaptively generates filters based on image contents. We apply the proposed dynamic filter to the mask branch, named Attention-based Adaptive Con-guided mask (ACG-Mask) branch, which is added to anchor-free one-stage object detector (FCOS). Besides, we design a multi-scale head, which contains an improved Receptive Field Block (iRFB) to enhance the discriminability and robustness of the feature. We name our model as Attention-based Dynamic Filters for anchor-free Instance Segmentation (ADFInst). Extensive experiments on the fine-annotation Cityscapes and COCO datasets reveal the effectiveness of the proposed method. ADFInst achieves a new record 37.9% AP and 63.3% AP50 on the fine-annotation Cityscapes dataset and achieves 37.8% AP, 58.7% AP50, and 40.5% AP75 on COCO dataset.
KW - ADFInst
KW - Anchor-free
KW - Dynamic Filters
KW - Instance Segmentation
UR - http://www.scopus.com/inward/record.url?scp=85117278925&partnerID=8YFLogxK
U2 - 10.23919/CCC52363.2021.9550216
DO - 10.23919/CCC52363.2021.9550216
M3 - Conference contribution
AN - SCOPUS:85117278925
T3 - Chinese Control Conference, CCC
SP - 7156
EP - 7161
BT - Proceedings of the 40th Chinese Control Conference, CCC 2021
A2 - Peng, Chen
A2 - Sun, Jian
PB - IEEE Computer Society
T2 - 40th Chinese Control Conference, CCC 2021
Y2 - 26 July 2021 through 28 July 2021
ER -