Abstract
The 4-D radar sensing provides rich measurements across Doppler, range, azimuth, and elevation dimensions, offering strong resilience in adverse environmental conditions. However, reconstructing accurate 3-D occupancy maps from 4-D radar tensors (4DRT) remains challenging due to low spatial resolution, nonuniform sampling, and artifacts caused by sidelobes and multipath reflections. This article proposes a diffusion-based framework for 3-D occupancy reconstruction that directly leverages the encoded structure of 4DRT data. The pipeline consists of three components: a feature-preserving dimensionality reduction module that produces Doppler-aware sparse descriptors from raw 4DRT; a hierarchical pillar-based representation that encodes vertical geometry using normalized height segments within a compact and structured spatial format; and a conditional diffusion model that iteratively denoises latent occupancy predictions. To encode the sparse and irregular 4DRT inputs, we design a hybrid condition encoder that combines convolutional layers with self-attention to extract both local and global contextual features. These are injected into a U-Net-based denoising network via cross-attention to generate dense and spatially consistent 3-D occupancy volumes. Extensive experiments on the Coloradar benchmark and real-world driving data demonstrate that the proposed method consistently outperforms the state-of-the-art baseline, improving intersection over union (IoU) from at most 3.3% to over 30% across diverse scenes, while reducing the Chamfer distance (CD) by more than 4× and maintaining a single-frame inference latency of approximately 50 ms.
| Original language | English |
|---|---|
| Pages (from-to) | 19125-19140 |
| Number of pages | 16 |
| Journal | IEEE Internet of Things Journal |
| Volume | 13 |
| Issue number | 9 |
| DOIs | |
| Publication status | Published - 1 May 2026 |
| Externally published | Yes |
Keywords
- 3-D occupancy mapping
- 4-D radar tensor (4DRT)
- autonomous driving
- diffusion models
- robotics perception
Fingerprint
Dive into the research topics of 'Diffusion-Based Reconstruction of 3-D Occupancy Maps From 4-D Radar Tensors'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver