TY - GEN
T1 - Multimodal Trajectory Prediction with Intention Recognition and Learnable Path Queries
AU - Yang, Jialong
AU - Sun, Zhongqi
AU - He, Zhen
AU - Du, Changkun
AU - Xia, Yuanqing
N1 - Publisher Copyright:
© 2025 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2025
Y1 - 2025
N2 - Trajectory prediction is a critical component of autonomous driving systems. However, it is a highly challenging task due to the complexity of environmental contexts and the multimodal nature of behavioral patterns. To address this issue, we propose an LSTM-based trajectory prediction framework that adopts an encoder-decoder structure. In the encoder, we employ feature-wise and agent-wise attention mechanisms to enhance the LSTM's ability to extract contextual information. Moreover, unlike traditional methods that decode multimodal trajectories using a one-to-many framework based on the features output by the encoder, we propose a novel many-to-many trajectory decoding framework which guides the model in generating diverse trajectories by combining different lane change intentions with learnable path queries. Compared to conventional approaches, our framework provides stronger interpretability, enabling a deeper understanding of the vehicle's complex multimodal behaviors. We conducted experimental validation on the highD dataset, and the results demonstrate that our multimodal trajectory prediction framework outperforms the baseline in terms of prediction accuracy. The open-source code can be found at https://github.com/Yang-JL-bit/trajectory-prediction-with-intention-and-path-queries.git.
AB - Trajectory prediction is a critical component of autonomous driving systems. However, it is a highly challenging task due to the complexity of environmental contexts and the multimodal nature of behavioral patterns. To address this issue, we propose an LSTM-based trajectory prediction framework that adopts an encoder-decoder structure. In the encoder, we employ feature-wise and agent-wise attention mechanisms to enhance the LSTM's ability to extract contextual information. Moreover, unlike traditional methods that decode multimodal trajectories using a one-to-many framework based on the features output by the encoder, we propose a novel many-to-many trajectory decoding framework which guides the model in generating diverse trajectories by combining different lane change intentions with learnable path queries. Compared to conventional approaches, our framework provides stronger interpretability, enabling a deeper understanding of the vehicle's complex multimodal behaviors. We conducted experimental validation on the highD dataset, and the results demonstrate that our multimodal trajectory prediction framework outperforms the baseline in terms of prediction accuracy. The open-source code can be found at https://github.com/Yang-JL-bit/trajectory-prediction-with-intention-and-path-queries.git.
KW - Intention Recognition
KW - Learnable Path Queries
KW - LSTM
KW - Multimodal Trajectory Prediction
UR - https://www.scopus.com/pages/publications/105020292798
U2 - 10.23919/CCC64809.2025.11179791
DO - 10.23919/CCC64809.2025.11179791
M3 - Conference contribution
AN - SCOPUS:105020292798
T3 - Chinese Control Conference, CCC
SP - 9137
EP - 9142
BT - Proceedings of the 44th Chinese Control Conference, CCC 2025
A2 - Sun, Jian
A2 - Yin, Hongpeng
PB - IEEE Computer Society
T2 - 44th Chinese Control Conference, CCC 2025
Y2 - 28 July 2025 through 30 July 2025
ER -