HPV-RCNN: Hybrid Point-Voxel Two-Stage Network for LiDAR-Based 3-D Object Detection

Chen Feng, Chao Xiang, Xiaopo Xie, Yuan Zhang, Mingchuan Yang, Xuesong Li*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

4 引用 (Scopus)

摘要

The current two-stage detectors remarkably benefit from hybrid representation of points and 3-D voxels, but they have high time cost and leave room for improving the accuracy of small objects. On the contrary, 2-D voxel-based methods tend to have good efficiency and better performance for small objects. An intuitive idea of optimizing a two-stage algorithm is to use a 2-D voxel-based backbone. However, naive representation substitution cannot achieve optimal joint learning of each representation and may cause a decrease in accuracy. In this article, we propose hybrid point-voxel RCNN (HPV-RCNN), a novel point cloud detection network which combines the merits of points and 2-D voxels. First, we propose a multiattentive voxel feature encoding module (MAVFE) to exploit multilevel attention of multiscale voxels. We also present a partial fusion pyramid network (PFPN) to effectively integrate multiresolution features and generate high-quality proposals. Then, a multiscale region of interest (RoI)-grid pooling (MSRGP) module is proposed to adaptively abstract proposal-specific features from sampled keypoints in multiple receptive fields. In addition, a cascade attentive module (CAM) is adopted to achieve incrementally proposal refinement by subsequent multiple subnetworks. Our method reaches top performance among two-stage methods in Cyclist and Pedestrian categories on the KITTI dataset while achieving real-time inference speed. Extensive experiments on challenging roadside DAIR-V2X-I dataset also demonstrate that our method achieves superior detection performance.

源语言英语
页(从-至)3066-3076
页数11
期刊IEEE Transactions on Computational Social Systems
10
6
DOI
出版状态已出版 - 1 12月 2023

指纹

探究 'HPV-RCNN: Hybrid Point-Voxel Two-Stage Network for LiDAR-Based 3-D Object Detection' 的科研主题。它们共同构成独一无二的指纹。

引用此