PA3DNet: 3-D Vehicle Detection with Pseudo Shape Segmentation and Adaptive Camera-LiDAR Fusion

Meiling Wang; Lin Zhao; Yufeng Yue

doi:10.1109/TII.2023.3241585

PA3DNet: 3-D Vehicle Detection with Pseudo Shape Segmentation and Adaptive Camera-LiDAR Fusion

Meiling Wang, Lin Zhao, Yufeng Yue^*

^*Corresponding author for this work

School of Automation

Beijing Institute of Technology

Research output: Contribution to journal › Article › peer-review

19 Citations (Scopus)

Abstract

3-D vehicle detection is a key perception technique in autonomous driving. In this article, a novel 3-D vehicle detection framework that fuses camera images and Light Detection and Ranging (LiDAR) point clouds is proposed, named PA3DNet. The key novelties of PA3DNet are the proposing of a pseudo shape segmentation (PSS) model and an adaptive camera-LiDAR fusion (ACLF) module. The PSS model leverages self-assembled vehicle prototypes to learn shape-aware vehicle features. In order to achieve the adaptive fusion between visual semantics and LiDAR point features, learnable weight parameters are developed in the ACLF module to formulate an implicit complementarity between the two modalities. Extensive experiments on the widely used autonomous driving KITTI dataset demonstrate that PA3DNet achieves competitive accuracy when compared to advanced methods. It achieves 5.37% higher average precision (AP) on easy difficulty of 30-50 m and 9.67% higher AP on moderate difficulty of >50 m.

Original language	English
Pages (from-to)	10693-10703
Number of pages	11
Journal	IEEE Transactions on Industrial Informatics
Volume	19
Issue number	11
DOIs	https://doi.org/10.1109/TII.2023.3241585
Publication status	Published - 1 Nov 2023

Keywords

3-D object detection
autonomous driving
multimodal fusion

Access to Document

10.1109/TII.2023.3241585

Cite this

Wang, M., Zhao, L., & Yue, Y. (2023). PA3DNet: 3-D Vehicle Detection with Pseudo Shape Segmentation and Adaptive Camera-LiDAR Fusion. IEEE Transactions on Industrial Informatics, 19(11), 10693-10703. https://doi.org/10.1109/TII.2023.3241585

@article{50322952871e4fc99cb6e0c5e17dfc6f,

title = "PA3DNet: 3-D Vehicle Detection with Pseudo Shape Segmentation and Adaptive Camera-LiDAR Fusion",

abstract = "3-D vehicle detection is a key perception technique in autonomous driving. In this article, a novel 3-D vehicle detection framework that fuses camera images and Light Detection and Ranging (LiDAR) point clouds is proposed, named PA3DNet. The key novelties of PA3DNet are the proposing of a pseudo shape segmentation (PSS) model and an adaptive camera-LiDAR fusion (ACLF) module. The PSS model leverages self-assembled vehicle prototypes to learn shape-aware vehicle features. In order to achieve the adaptive fusion between visual semantics and LiDAR point features, learnable weight parameters are developed in the ACLF module to formulate an implicit complementarity between the two modalities. Extensive experiments on the widely used autonomous driving KITTI dataset demonstrate that PA3DNet achieves competitive accuracy when compared to advanced methods. It achieves 5.37% higher average precision (AP) on easy difficulty of 30-50 m and 9.67% higher AP on moderate difficulty of >50 m.",

keywords = "3-D object detection, autonomous driving, multimodal fusion",

author = "Meiling Wang and Lin Zhao and Yufeng Yue",

note = "Publisher Copyright: {\textcopyright} 2005-2012 IEEE.",

year = "2023",

month = nov,

day = "1",

doi = "10.1109/TII.2023.3241585",

language = "English",

volume = "19",

pages = "10693--10703",

journal = "IEEE Transactions on Industrial Informatics",

issn = "1551-3203",

publisher = "IEEE Computer Society",

number = "11",

}

TY - JOUR

T1 - PA3DNet

T2 - 3-D Vehicle Detection with Pseudo Shape Segmentation and Adaptive Camera-LiDAR Fusion

AU - Wang, Meiling

AU - Zhao, Lin

AU - Yue, Yufeng

PY - 2023/11/1

Y1 - 2023/11/1

N2 - 3-D vehicle detection is a key perception technique in autonomous driving. In this article, a novel 3-D vehicle detection framework that fuses camera images and Light Detection and Ranging (LiDAR) point clouds is proposed, named PA3DNet. The key novelties of PA3DNet are the proposing of a pseudo shape segmentation (PSS) model and an adaptive camera-LiDAR fusion (ACLF) module. The PSS model leverages self-assembled vehicle prototypes to learn shape-aware vehicle features. In order to achieve the adaptive fusion between visual semantics and LiDAR point features, learnable weight parameters are developed in the ACLF module to formulate an implicit complementarity between the two modalities. Extensive experiments on the widely used autonomous driving KITTI dataset demonstrate that PA3DNet achieves competitive accuracy when compared to advanced methods. It achieves 5.37% higher average precision (AP) on easy difficulty of 30-50 m and 9.67% higher AP on moderate difficulty of >50 m.

AB - 3-D vehicle detection is a key perception technique in autonomous driving. In this article, a novel 3-D vehicle detection framework that fuses camera images and Light Detection and Ranging (LiDAR) point clouds is proposed, named PA3DNet. The key novelties of PA3DNet are the proposing of a pseudo shape segmentation (PSS) model and an adaptive camera-LiDAR fusion (ACLF) module. The PSS model leverages self-assembled vehicle prototypes to learn shape-aware vehicle features. In order to achieve the adaptive fusion between visual semantics and LiDAR point features, learnable weight parameters are developed in the ACLF module to formulate an implicit complementarity between the two modalities. Extensive experiments on the widely used autonomous driving KITTI dataset demonstrate that PA3DNet achieves competitive accuracy when compared to advanced methods. It achieves 5.37% higher average precision (AP) on easy difficulty of 30-50 m and 9.67% higher AP on moderate difficulty of >50 m.

KW - 3-D object detection

KW - autonomous driving

KW - multimodal fusion

UR - http://www.scopus.com/inward/record.url?scp=85148443868&partnerID=8YFLogxK

U2 - 10.1109/TII.2023.3241585

DO - 10.1109/TII.2023.3241585

M3 - Article

AN - SCOPUS:85148443868

SN - 1551-3203

VL - 19

SP - 10693

EP - 10703

JO - IEEE Transactions on Industrial Informatics

JF - IEEE Transactions on Industrial Informatics

IS - 11

ER -

PA3DNet: 3-D Vehicle Detection with Pseudo Shape Segmentation and Adaptive Camera-LiDAR Fusion

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this