TY - GEN
T1 - Reinforcement Learning-Based Autonomous Collision Avoidance for Ships in Realistic Physical Environments
AU - Ding, Ying
AU - Meng, Weizhi
AU - He, Shaoming
AU - Tan, Yu An
AU - Li, Wenjuan
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2026.
PY - 2026
Y1 - 2026
N2 - The capacity of detecting and avoiding collisions among ships without human intervention is a key requirement for future cyber-ships. This work investigates a deep reinforcement learning (DRL)-based framework for multi-vessel collision avoidance, which enables autonomous collision avoidance and path planning for ships in complex maritime scenarios. The proposed method addresses key challenges such as dynamic encounter geometries, limited inter-ship communication, and adherence to COLREGs by modeling the decision process under partial observability. Three navigation strategies–velocity obstacle, deep Q-network, and recurrent LSTM-DQN–are evaluated within a unified simulation environment. Both a hexagon-based ship domain and a real-time distance-based safety metric are incorporated to assess collision risk and ensure rule compliance. The framework is validated on multiple Imazu-inspired test cases, covering diverse encounter types including head-on, overtaking, and starboard crossing. Experimental results show that the LSTM-DQN model consistently achieves safer trajectories, lower minimum distances, and higher compliance rates compared to baseline methods. These findings demonstrate the potential of our DRL-driven policies to support reliable, scalable, and regulation-aware decision-making for autonomous surface ships in a realistic environment.
AB - The capacity of detecting and avoiding collisions among ships without human intervention is a key requirement for future cyber-ships. This work investigates a deep reinforcement learning (DRL)-based framework for multi-vessel collision avoidance, which enables autonomous collision avoidance and path planning for ships in complex maritime scenarios. The proposed method addresses key challenges such as dynamic encounter geometries, limited inter-ship communication, and adherence to COLREGs by modeling the decision process under partial observability. Three navigation strategies–velocity obstacle, deep Q-network, and recurrent LSTM-DQN–are evaluated within a unified simulation environment. Both a hexagon-based ship domain and a real-time distance-based safety metric are incorporated to assess collision risk and ensure rule compliance. The framework is validated on multiple Imazu-inspired test cases, covering diverse encounter types including head-on, overtaking, and starboard crossing. Experimental results show that the LSTM-DQN model consistently achieves safer trajectories, lower minimum distances, and higher compliance rates compared to baseline methods. These findings demonstrate the potential of our DRL-driven policies to support reliable, scalable, and regulation-aware decision-making for autonomous surface ships in a realistic environment.
KW - Collision avoidance
KW - Cyber Ship
KW - Data security
KW - Deep reinforcement learning
KW - Maritime environment
KW - Maritime safety
UR - https://www.scopus.com/pages/publications/105037576571
U2 - 10.1007/978-981-95-8408-6_4
DO - 10.1007/978-981-95-8408-6_4
M3 - Conference contribution
AN - SCOPUS:105037576571
SN - 9789819584079
T3 - Lecture Notes in Computer Science
SP - 59
EP - 78
BT - Algorithms and Architectures for Parallel Processing - 25th International Conference, ICA3PP 2025, Proceedings
A2 - Liu, Huazhong
A2 - Ibrahim, Shadi
A2 - Rauber, Thomas
PB - Springer Science and Business Media Deutschland GmbH
T2 - 25th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2025
Y2 - 30 October 2025 through 2 November 2025
ER -