PillarID: Rethinking Backbone Network Designs for Pillar-Based 3D Object Detection in Infrastructure Point Cloud

  • Zhang Zhang
  • , Chao Sun*
  • , Bo Wang
  • , Da Wen
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

In recent years, vehicle-centric point cloud 3D object detection has been widely explored and effectively developed. However, due to differences in the placement of sensors, infrastructure-centric point cloud 3D object detection, which is an important component of the Intelligent Transportation System (ITS), has not received sufficient attention as well as effective network architecture design. Based on the difference in perspective of the infrastructure point cloud, We discover that the roadside point cloud is denser and with a higher coverage compared to the vehicle-side in the pillar representation, resulting in a narrowing of the performance difference between dense pillar and sparse pillar backbone networks in roadside scenes. Inspired by this insight, a network based on the dense backbone is proposed, dubbed PillarID. It utilizes Single-stride Cross-stage Dense-backbone (SCD) to obtains efficient computation through channel degradation, split, and cross-stage connection, and benefits from the rich context of the roadside point cloud based on single-stride. Further, Hierarchical Receptive-field Expansion (HRE) are used to address the receptive field constraints of single-stride backbone. Extensive experiments reveal that our PillarID achieves effective designs in terms of architecture and renders the state-of-the-art performance on the popular large-scale roadside benchmark: DAIR-V2X-I and RCooper.

Original languageEnglish
Pages (from-to)232-240
Number of pages9
JournalIEEE Transactions on Intelligent Transportation Systems
Volume27
Issue number1
DOIs
Publication statusPublished - 2026

Keywords

  • Point cloud
  • infrastructure
  • object detection

Fingerprint

Dive into the research topics of 'PillarID: Rethinking Backbone Network Designs for Pillar-Based 3D Object Detection in Infrastructure Point Cloud'. Together they form a unique fingerprint.

Cite this