ReAGFormer: Reaggregation Transformer with Affine Group Features for 3D Object Detection

Chenguang Lu, Kang Yue, Yue Liu*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Direct detection of 3D objects from point clouds is a challenging task due to sparsity and irregularity of point clouds. To capture point features from the raw point clouds for 3D object detection, most previous researches utilize PointNet and its variants as the feature learning backbone and have seen encouraging results. However, these methods capture point features independently without modeling the interaction between points, and simple symmetric functions cannot adequately aggregate local contextual features, which are vital for 3D object recognition. To address such limitations, we propose ReAGFormer, a reaggregation Transformer backbone with affine group features for point feature learning in 3D object detection, which can capture the dependencies between points on the aligned group feature space while retaining the flexible receptive fields. The key idea of ReAGFormer is to alleviate the perturbation of the point feature space by affine transformation and extract the dependencies between points using self-attention, while reaggregating the local point set features with the learned attention. Moreover, we also design multi-scale connections in the feature propagation layer to reduce the geometric information loss caused by point sampling and interpolation. Experimental results show that by equipping our method as the backbone for existing 3D object detectors, significant improvements and state-of-the-art performance are achieved over original models on SUN RGB-D and ScanNet V2 benchmarks.

源语言英语
主期刊名Computer Vision – ACCV 2022 - 16th Asian Conference on Computer Vision, 2022, Proceedings
编辑Lei Wang, Juergen Gall, Tat-Jun Chin, Imari Sato, Rama Chellappa
出版商Springer Science and Business Media Deutschland GmbH
262-279
页数18
ISBN(印刷版)9783031263187
DOI
出版状态已出版 - 2023
活动16th Asian Conference on Computer Vision, ACCV 2022 - Macao, 中国
期限: 4 12月 20228 12月 2022

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13841 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议16th Asian Conference on Computer Vision, ACCV 2022
国家/地区中国
Macao
时期4/12/228/12/22

指纹

探究 'ReAGFormer: Reaggregation Transformer with Affine Group Features for 3D Object Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此