SGV3D: Toward Scenario Generalization for Vision-Based Roadside 3D Object Detection

  • Lei Yang
  • , Xinyu Zhang*
  • , Jun Li
  • , Li Wang*
  • , Chuang Zhang
  • , Li Ju
  • , Zhiwei Li
  • , Yang Shen
  • , Chen Lv
  • , Hong Wang
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Roadside perception can significantly enhance the safety of autonomous vehicles by extending their perceptual capabilities beyond the visual range and addressing occluded regions. However, current state-of-the-art vision-based roadside detection methods exhibit high accuracy on labeled scenes but perform poorly on new scenes. This limitation arises because roadside cameras remain stationary after installation and can only gather data from a single scene, leading the algorithm to overfit these roadside backgrounds and camera positions. To tackle this issue, we propose an innovative Scenario Generalization Framework for Vision-based Roadside 3D Object Detection, called SGV3D. Specifically, we utilize a Background-suppressed Module (BSM) to reduce background overfitting in vision-centric pipelines by diminishing background features during the 2D to bird's-eye-view projection. Furthermore, by introducing the Semi-supervised Data Generation Pipeline (SSDG) that employs unlabeled images from new scenes, we generate diverse foreground instances with varying camera poses, mitigating the risk of overfitting to specific camera positions. Experiments conducted on two large-scale roadside benchmarks demonstrate that SGV3D, with only a minimal increase in latency, effectively improves the scenario generalization capabilities of vision-based roadside 3D object detectors. The code is available here (https://github.com/yanglei18/SGV3D).

Original languageEnglish
Pages (from-to)11782-11793
Number of pages12
JournalIEEE Transactions on Intelligent Transportation Systems
Volume26
Issue number8
DOIs
Publication statusPublished - 2025
Externally publishedYes

Keywords

  • Scenario generalization
  • autonomous driving
  • roadside perception
  • vision-based 3D object detection

Fingerprint

Dive into the research topics of 'SGV3D: Toward Scenario Generalization for Vision-Based Roadside 3D Object Detection'. Together they form a unique fingerprint.

Cite this