Abstract
Point cloud object detection is a fundamental task in various 3D perception applications such as autonomous driving and mobile robotics. However, most existing methods suffer from inaccurate 3D localization and intensive computational costs due to the inherent sparsity and irregular spatial distribution of point cloud data. This paper presents Point-HN, a point–voxel Hybrid Network that effectively balances accuracy and efficiency in 3D object detection on point clouds. Point-HN incorporates two designs that jointly model spatial structures and preserve fine-grained geometric details. The first is a Retentive Spatial Transformer (RST) that explicitly models spatial relations between points and voxels, thereby highlighting critical features around potential object regions. The second is a Geometric-Context Enhancer (GCE) that introduces a graph-based representation to capture geometric relations among key points, thereby effectively compensating for the loss of fine-grained information during voxelization with minimal overhead. Extensive experiments across multiple widely adopted autonomous driving benchmarks show that Point-HN achieves superior performance in 3D object detection consistently, underscoring its broad applicability in various real-world 3D perception tasks.
| Original language | English |
|---|---|
| Article number | 133448 |
| Journal | Neurocomputing |
| Volume | 683 |
| DOIs | |
| Publication status | Published - 28 Jun 2026 |
Keywords
- 3D object detection
- Graph learning
- Hybrid network
- Point-voxel fusion
- Spatial attention
Fingerprint
Dive into the research topics of 'Point-HN: Unified spatial-context modeling for fast and accurate 3D point cloud detection'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver