One-stage anchor-free 3d vehicle detection from lidar sensors

Hao Li; Sanyuan Zhao; Wenjun Zhao; Libin Zhang; Jianbing Shen

doi:10.3390/s21082651

One-stage anchor-free 3d vehicle detection from lidar sensors

Hao Li, Sanyuan Zhao^*, Wenjun Zhao, Libin Zhang, Jianbing Shen

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

19 引用（Scopus）

摘要

Recent one-stage 3D detection methods generate anchor boxes with various sizes and orientations in the ground plane, then determine whether these anchor boxes contain any region of interest and adjust the edges of them for accurate object bounding boxes. The anchor-based algorithm calculates the classification and regression label for each anchor box during the training process, which is inefficient and complicated. We propose a one-stage, anchor-free 3D vehicle detection algorithm based on LiDAR point clouds. The object position is encoded as a set of keypoints in the bird’s-eye view (BEV) of point clouds. We apply the voxel/pillar feature extractor and convolutional blocks to map an unstructured point cloud to a single-channel 2D heatmap. The vehicle’s Z-axis position, dimension, and orientation angle are regressed as additional attributes of the keypoints. Our method combines SmoothL1 loss and IoU (Intersection over Union) loss, and we apply (cos θ, sin θ) as angle regression labels, which achieve high average orientation similarity (AOS) without any direction classification tricks. During the target assignment and bounding box decoding process, our framework completely avoids any calculations related to anchor boxes. Our framework is end-to-end training and stands at the same performance level as the other one-stage anchor-based detectors.

源语言	英语
文章编号	2651
期刊	Sensors
卷	21
期	8
DOI	https://doi.org/10.3390/s21082651
出版状态	已出版 - 2 4月 2021

访问文件

10.3390/s21082651

其它文件与链接

链接到 Scopus 的出版物

引用此

Li, H., Zhao, S., Zhao, W., Zhang, L., & Shen, J. (2021). One-stage anchor-free 3d vehicle detection from lidar sensors. Sensors, 21(8), 文章 2651. https://doi.org/10.3390/s21082651

@article{600bd2e7b2e94c9882712c7f5de5825f,

title = "One-stage anchor-free 3d vehicle detection from lidar sensors",

abstract = "Recent one-stage 3D detection methods generate anchor boxes with various sizes and orientations in the ground plane, then determine whether these anchor boxes contain any region of interest and adjust the edges of them for accurate object bounding boxes. The anchor-based algorithm calculates the classification and regression label for each anchor box during the training process, which is inefficient and complicated. We propose a one-stage, anchor-free 3D vehicle detection algorithm based on LiDAR point clouds. The object position is encoded as a set of keypoints in the bird{\textquoteright}s-eye view (BEV) of point clouds. We apply the voxel/pillar feature extractor and convolutional blocks to map an unstructured point cloud to a single-channel 2D heatmap. The vehicle{\textquoteright}s Z-axis position, dimension, and orientation angle are regressed as additional attributes of the keypoints. Our method combines SmoothL1 loss and IoU (Intersection over Union) loss, and we apply (cos θ, sin θ) as angle regression labels, which achieve high average orientation similarity (AOS) without any direction classification tricks. During the target assignment and bounding box decoding process, our framework completely avoids any calculations related to anchor boxes. Our framework is end-to-end training and stands at the same performance level as the other one-stage anchor-based detectors.",

keywords = "3D detection, Anchor-free detector, One-stage detector",

author = "Hao Li and Sanyuan Zhao and Wenjun Zhao and Libin Zhang and Jianbing Shen",

note = "Publisher Copyright: {\textcopyright} 2021 by the authors. Licensee MDPI, Basel, Switzerland.",

year = "2021",

month = apr,

day = "2",

doi = "10.3390/s21082651",

language = "English",

volume = "21",

journal = "Sensors",

issn = "1424-8220",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "8",

}

TY - JOUR

T1 - One-stage anchor-free 3d vehicle detection from lidar sensors

AU - Li, Hao

AU - Zhao, Sanyuan

AU - Zhao, Wenjun

AU - Zhang, Libin

AU - Shen, Jianbing

PY - 2021/4/2

Y1 - 2021/4/2

N2 - Recent one-stage 3D detection methods generate anchor boxes with various sizes and orientations in the ground plane, then determine whether these anchor boxes contain any region of interest and adjust the edges of them for accurate object bounding boxes. The anchor-based algorithm calculates the classification and regression label for each anchor box during the training process, which is inefficient and complicated. We propose a one-stage, anchor-free 3D vehicle detection algorithm based on LiDAR point clouds. The object position is encoded as a set of keypoints in the bird’s-eye view (BEV) of point clouds. We apply the voxel/pillar feature extractor and convolutional blocks to map an unstructured point cloud to a single-channel 2D heatmap. The vehicle’s Z-axis position, dimension, and orientation angle are regressed as additional attributes of the keypoints. Our method combines SmoothL1 loss and IoU (Intersection over Union) loss, and we apply (cos θ, sin θ) as angle regression labels, which achieve high average orientation similarity (AOS) without any direction classification tricks. During the target assignment and bounding box decoding process, our framework completely avoids any calculations related to anchor boxes. Our framework is end-to-end training and stands at the same performance level as the other one-stage anchor-based detectors.

AB - Recent one-stage 3D detection methods generate anchor boxes with various sizes and orientations in the ground plane, then determine whether these anchor boxes contain any region of interest and adjust the edges of them for accurate object bounding boxes. The anchor-based algorithm calculates the classification and regression label for each anchor box during the training process, which is inefficient and complicated. We propose a one-stage, anchor-free 3D vehicle detection algorithm based on LiDAR point clouds. The object position is encoded as a set of keypoints in the bird’s-eye view (BEV) of point clouds. We apply the voxel/pillar feature extractor and convolutional blocks to map an unstructured point cloud to a single-channel 2D heatmap. The vehicle’s Z-axis position, dimension, and orientation angle are regressed as additional attributes of the keypoints. Our method combines SmoothL1 loss and IoU (Intersection over Union) loss, and we apply (cos θ, sin θ) as angle regression labels, which achieve high average orientation similarity (AOS) without any direction classification tricks. During the target assignment and bounding box decoding process, our framework completely avoids any calculations related to anchor boxes. Our framework is end-to-end training and stands at the same performance level as the other one-stage anchor-based detectors.

KW - 3D detection

KW - Anchor-free detector

KW - One-stage detector

UR - http://www.scopus.com/inward/record.url?scp=85103824853&partnerID=8YFLogxK

U2 - 10.3390/s21082651

DO - 10.3390/s21082651

M3 - Article

C2 - 33918952

AN - SCOPUS:85103824853

SN - 1424-8220

VL - 21

JO - Sensors

JF - Sensors

IS - 8

M1 - 2651

ER -

One-stage anchor-free 3d vehicle detection from lidar sensors

摘要

访问文件

其它文件与链接

指纹

引用此