Multi-agent trajectory prediction with Hierarchical Coordinate-Based Representation

Research output: Contribution to journalArticlepeer-review

Abstract

Trajectory prediction plays a crucial role in autonomous driving systems. Existing methods typically adopt agent-centric or scene-centric approaches to model driving scenarios. However, these approaches often lead to excessive redundant computations or pose estimation errors, which degrade both prediction efficiency and accuracy. To address these issues, we propose a novel multi-agent trajectory prediction model called Hierarchical Coordinate-Based Representation. This model decomposes the driving environment into two distinct components: global and local. In the global component, the interaction information is established and shared between all predicted agents, helping to reduce redundant computations. In the local component, each vehicle is assigned an individual reference frame, which mitigates the impact of pose variations and facilitates the extraction of temporal features. Furthermore, we introduce an adaptive anchor point generation method to tackle the challenge of capturing future trajectories across different driving scenarios. This method dynamically generates anchor points that are tailored to each specific scenario, guiding the prediction of trajectories for various modalities. The performance of the proposed model is evaluated on the Argoverse 1 and Argoverse 2 datasets. Experimental results demonstrate that Hierarchical Coordinate-Based Representation achieves competitive performance in terms of both efficiency and accuracy, outperforming state-of-the-art methods.

Original languageEnglish
Article number113335
JournalEngineering Applications of Artificial Intelligence
Volume165
DOIs
Publication statusPublished - 1 Feb 2026

Keywords

  • Attention mechanism
  • Online mapping
  • Spatio-temporal feature fusion
  • Trajectory prediction

Fingerprint

Dive into the research topics of 'Multi-agent trajectory prediction with Hierarchical Coordinate-Based Representation'. Together they form a unique fingerprint.

Cite this