Skip to main navigation Skip to search Skip to main content

Diffusion-Based Reconstruction of 3-D Occupancy Maps From 4-D Radar Tensors

  • Fan Yang
  • , Xueyuan Li*
  • , Minggang Du
  • , Yutong Jiang
  • , Fandong Qiao
  • , Zhi Niu
  • *Corresponding author for this work
  • Beijing Institute of Technology
  • Xinhuo Autonomous Vehicle Laboratory

Research output: Contribution to journalArticlepeer-review

Abstract

The 4-D radar sensing provides rich measurements across Doppler, range, azimuth, and elevation dimensions, offering strong resilience in adverse environmental conditions. However, reconstructing accurate 3-D occupancy maps from 4-D radar tensors (4DRT) remains challenging due to low spatial resolution, nonuniform sampling, and artifacts caused by sidelobes and multipath reflections. This article proposes a diffusion-based framework for 3-D occupancy reconstruction that directly leverages the encoded structure of 4DRT data. The pipeline consists of three components: a feature-preserving dimensionality reduction module that produces Doppler-aware sparse descriptors from raw 4DRT; a hierarchical pillar-based representation that encodes vertical geometry using normalized height segments within a compact and structured spatial format; and a conditional diffusion model that iteratively denoises latent occupancy predictions. To encode the sparse and irregular 4DRT inputs, we design a hybrid condition encoder that combines convolutional layers with self-attention to extract both local and global contextual features. These are injected into a U-Net-based denoising network via cross-attention to generate dense and spatially consistent 3-D occupancy volumes. Extensive experiments on the Coloradar benchmark and real-world driving data demonstrate that the proposed method consistently outperforms the state-of-the-art baseline, improving intersection over union (IoU) from at most 3.3% to over 30% across diverse scenes, while reducing the Chamfer distance (CD) by more than 4× and maintaining a single-frame inference latency of approximately 50 ms.

Original languageEnglish
Pages (from-to)19125-19140
Number of pages16
JournalIEEE Internet of Things Journal
Volume13
Issue number9
DOIs
Publication statusPublished - 1 May 2026
Externally publishedYes

Keywords

  • 3-D occupancy mapping
  • 4-D radar tensor (4DRT)
  • autonomous driving
  • diffusion models
  • robotics perception

Fingerprint

Dive into the research topics of 'Diffusion-Based Reconstruction of 3-D Occupancy Maps From 4-D Radar Tensors'. Together they form a unique fingerprint.

Cite this