Dual-view 3D object recognition and detection via Lidar point cloud and camera image

Jing Li; Rui Li; Jiehao Li; Junzheng Wang; Qingbin Wu; Xu Liu

doi:10.1016/j.robot.2021.103999

Dual-view 3D object recognition and detection via Lidar point cloud and camera image

Jing Li, Rui Li, Jiehao Li^*, Junzheng Wang, Qingbin Wu, Xu Liu

^*Corresponding author for this work

School of Automation

Research output: Contribution to journal › Article › peer-review

44 Citations (Scopus)

Abstract

When it comes to the accuracy of autonomous motion, it is necessary to consider object detection and recognition, especially for the robot application of the complex environment. This paper investigates novel dual-view 3D object detection networks combined with the Lidar point cloud and RGB image in engineering scenarios. The developed system is applied for autonomous vehicles that the detected objects are cars, cyclists, and pedestrians. Firstly, a feature extraction network based on the residual module is presented, and the specific features are from the RGB image. The point cloud is transformed into Bird's Eye View (BEV), and the BEV feature extraction network is built based on sparse convolution. Besides, the feature maps are input into the region proposal network (RPN) to obtain the optimal proposal so that the object classification and the bounding box regression are obtained. Finally, to evaluate the flexibility of the developed framework, extensive data sets are generated through the CARLA simulator and verified on the KITTI data set and unmanned motion platform (BIT-NAZA robot), indicating that the proposed networks can achieve satisfactory performance in the real-world scenario.

Original language	English
Article number	103999
Journal	Robotics and Autonomous Systems
Volume	150
DOIs	https://doi.org/10.1016/j.robot.2021.103999
Publication status	Published - Apr 2022

Keywords

Autonomous system
Lidar point cloud
Object detection
RGB image
Sensor fusion

Access to Document

10.1016/j.robot.2021.103999

Cite this

@article{85e1ee78d1934b9f984ff0f5655d972e,

title = "Dual-view 3D object recognition and detection via Lidar point cloud and camera image",

abstract = "When it comes to the accuracy of autonomous motion, it is necessary to consider object detection and recognition, especially for the robot application of the complex environment. This paper investigates novel dual-view 3D object detection networks combined with the Lidar point cloud and RGB image in engineering scenarios. The developed system is applied for autonomous vehicles that the detected objects are cars, cyclists, and pedestrians. Firstly, a feature extraction network based on the residual module is presented, and the specific features are from the RGB image. The point cloud is transformed into Bird's Eye View (BEV), and the BEV feature extraction network is built based on sparse convolution. Besides, the feature maps are input into the region proposal network (RPN) to obtain the optimal proposal so that the object classification and the bounding box regression are obtained. Finally, to evaluate the flexibility of the developed framework, extensive data sets are generated through the CARLA simulator and verified on the KITTI data set and unmanned motion platform (BIT-NAZA robot), indicating that the proposed networks can achieve satisfactory performance in the real-world scenario.",

keywords = "Autonomous system, Lidar point cloud, Object detection, RGB image, Sensor fusion",

author = "Jing Li and Rui Li and Jiehao Li and Junzheng Wang and Qingbin Wu and Xu Liu",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier B.V.",

year = "2022",

month = apr,

doi = "10.1016/j.robot.2021.103999",

language = "English",

volume = "150",

journal = "Robotics and Autonomous Systems",

issn = "0921-8890",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Dual-view 3D object recognition and detection via Lidar point cloud and camera image

AU - Li, Jing

AU - Li, Rui

AU - Li, Jiehao

AU - Wang, Junzheng

AU - Wu, Qingbin

AU - Liu, Xu

PY - 2022/4

Y1 - 2022/4

N2 - When it comes to the accuracy of autonomous motion, it is necessary to consider object detection and recognition, especially for the robot application of the complex environment. This paper investigates novel dual-view 3D object detection networks combined with the Lidar point cloud and RGB image in engineering scenarios. The developed system is applied for autonomous vehicles that the detected objects are cars, cyclists, and pedestrians. Firstly, a feature extraction network based on the residual module is presented, and the specific features are from the RGB image. The point cloud is transformed into Bird's Eye View (BEV), and the BEV feature extraction network is built based on sparse convolution. Besides, the feature maps are input into the region proposal network (RPN) to obtain the optimal proposal so that the object classification and the bounding box regression are obtained. Finally, to evaluate the flexibility of the developed framework, extensive data sets are generated through the CARLA simulator and verified on the KITTI data set and unmanned motion platform (BIT-NAZA robot), indicating that the proposed networks can achieve satisfactory performance in the real-world scenario.

AB - When it comes to the accuracy of autonomous motion, it is necessary to consider object detection and recognition, especially for the robot application of the complex environment. This paper investigates novel dual-view 3D object detection networks combined with the Lidar point cloud and RGB image in engineering scenarios. The developed system is applied for autonomous vehicles that the detected objects are cars, cyclists, and pedestrians. Firstly, a feature extraction network based on the residual module is presented, and the specific features are from the RGB image. The point cloud is transformed into Bird's Eye View (BEV), and the BEV feature extraction network is built based on sparse convolution. Besides, the feature maps are input into the region proposal network (RPN) to obtain the optimal proposal so that the object classification and the bounding box regression are obtained. Finally, to evaluate the flexibility of the developed framework, extensive data sets are generated through the CARLA simulator and verified on the KITTI data set and unmanned motion platform (BIT-NAZA robot), indicating that the proposed networks can achieve satisfactory performance in the real-world scenario.

KW - Autonomous system

KW - Lidar point cloud

KW - Object detection

KW - RGB image

KW - Sensor fusion

UR - http://www.scopus.com/inward/record.url?scp=85122640498&partnerID=8YFLogxK

U2 - 10.1016/j.robot.2021.103999

DO - 10.1016/j.robot.2021.103999

M3 - Article

AN - SCOPUS:85122640498

SN - 0921-8890

VL - 150

JO - Robotics and Autonomous Systems

JF - Robotics and Autonomous Systems

M1 - 103999

ER -

Dual-view 3D object recognition and detection via Lidar point cloud and camera image

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this