跳到主要导航 跳到搜索 跳到主要内容

Reinforcement Learning-Based Autonomous Collision Avoidance for Ships in Realistic Physical Environments

  • Ying Ding
  • , Weizhi Meng*
  • , Shaoming He
  • , Yu An Tan
  • , Wenjuan Li
  • *此作品的通讯作者
  • Technical University of Denmark
  • Beijing Institute of Technology
  • Hong Kong College of Technology
  • Lancaster University
  • The Education University of Hong Kong

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The capacity of detecting and avoiding collisions among ships without human intervention is a key requirement for future cyber-ships. This work investigates a deep reinforcement learning (DRL)-based framework for multi-vessel collision avoidance, which enables autonomous collision avoidance and path planning for ships in complex maritime scenarios. The proposed method addresses key challenges such as dynamic encounter geometries, limited inter-ship communication, and adherence to COLREGs by modeling the decision process under partial observability. Three navigation strategies–velocity obstacle, deep Q-network, and recurrent LSTM-DQN–are evaluated within a unified simulation environment. Both a hexagon-based ship domain and a real-time distance-based safety metric are incorporated to assess collision risk and ensure rule compliance. The framework is validated on multiple Imazu-inspired test cases, covering diverse encounter types including head-on, overtaking, and starboard crossing. Experimental results show that the LSTM-DQN model consistently achieves safer trajectories, lower minimum distances, and higher compliance rates compared to baseline methods. These findings demonstrate the potential of our DRL-driven policies to support reliable, scalable, and regulation-aware decision-making for autonomous surface ships in a realistic environment.

源语言英语
主期刊名Algorithms and Architectures for Parallel Processing - 25th International Conference, ICA3PP 2025, Proceedings
编辑Huazhong Liu, Shadi Ibrahim, Thomas Rauber
出版商Springer Science and Business Media Deutschland GmbH
59-78
页数20
ISBN(印刷版)9789819584079
DOI
出版状态已出版 - 2026
已对外发布
活动25th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2025 - Zhengzhou, 中国
期限: 30 10月 20252 11月 2025

出版系列

姓名Lecture Notes in Computer Science
16384 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议25th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2025
国家/地区中国
Zhengzhou
时期30/10/252/11/25

指纹

探究 'Reinforcement Learning-Based Autonomous Collision Avoidance for Ships in Realistic Physical Environments' 的科研主题。它们共同构成独一无二的指纹。

引用此