跳到主要导航 跳到搜索 跳到主要内容

Point-HN: Unified spatial-context modeling for fast and accurate 3D point cloud detection

  • Haowei Yang
  • , Yuantao Wang
  • , Yaqian Ning*
  • , Shijian Lu
  • , Yin Zhuang
  • , Xuerui Mao
  • , Wei Zhang*
  • *此作品的通讯作者
  • Beijing University of Posts and Telecommunications
  • North China University of Technology
  • Beijing Jianhua Experimental School
  • Nanyang Technological University
  • Beijing Institute of Technology

科研成果: 期刊稿件文章同行评审

摘要

Point cloud object detection is a fundamental task in various 3D perception applications such as autonomous driving and mobile robotics. However, most existing methods suffer from inaccurate 3D localization and intensive computational costs due to the inherent sparsity and irregular spatial distribution of point cloud data. This paper presents Point-HN, a point–voxel Hybrid Network that effectively balances accuracy and efficiency in 3D object detection on point clouds. Point-HN incorporates two designs that jointly model spatial structures and preserve fine-grained geometric details. The first is a Retentive Spatial Transformer (RST) that explicitly models spatial relations between points and voxels, thereby highlighting critical features around potential object regions. The second is a Geometric-Context Enhancer (GCE) that introduces a graph-based representation to capture geometric relations among key points, thereby effectively compensating for the loss of fine-grained information during voxelization with minimal overhead. Extensive experiments across multiple widely adopted autonomous driving benchmarks show that Point-HN achieves superior performance in 3D object detection consistently, underscoring its broad applicability in various real-world 3D perception tasks.

源语言英语
文章编号133448
期刊Neurocomputing
683
DOI
出版状态已出版 - 28 6月 2026

指纹

探究 'Point-HN: Unified spatial-context modeling for fast and accurate 3D point cloud detection' 的科研主题。它们共同构成独一无二的指纹。

引用此