TY - JOUR
T1 - CLORP
T2 - Cross-Layer Opportunistic Routing Protocol for Underwater Sensor Networks Based on Multi-Agent Reinforcement Learning
AU - Liu, Shuai
AU - Wang, Jingjing
AU - Shi, Wei
AU - Han, Guangjie
AU - Yan, Shefeng
AU - Li, Jiaheng
N1 - Publisher Copyright:
IEEE
PY - 2024
Y1 - 2024
N2 - With the development of the Internet of Underwater Things (IoUT), both academia and industry have significant emphasized underwater wireless sensor networks (UWSNs). To address the issues of slow convergence, high latency, and limited energy in existing intelligent routing protocols in UWSNs, a cross-layer opportunistic routing protocol (CLORP) for underwater sensor networks based on multi-agent reinforcement learning (MARL) is proposed in this paper. First, CLORP combines the decision-making capability of multi-agent reinforcement learning with the idea of opportunistic routing to sequentially select a set of neighbors with larger values as potential forwarding nodes, thereby increasing the packet transmission success rate. Second, in the design of the MARL reward function, two reward functions for successful and unsuccessful packet transmission are designed jointly with cross-layer information to improve the routing protocol’s performance. Finally, two algorithmic optimization strategies, adaptive learning rate and Q-value initialization based on location and number of neighbors, are proposed to facilitate the faster adaptation of agents to the dynamic changes of network topology and accelerate CLORP convergence. The experimental results demonstrate that CLORP can increase algorithm convergence speed by 13.2%, reduce network energy consumption by 25%, and decrease network latency by 31.2%.
AB - With the development of the Internet of Underwater Things (IoUT), both academia and industry have significant emphasized underwater wireless sensor networks (UWSNs). To address the issues of slow convergence, high latency, and limited energy in existing intelligent routing protocols in UWSNs, a cross-layer opportunistic routing protocol (CLORP) for underwater sensor networks based on multi-agent reinforcement learning (MARL) is proposed in this paper. First, CLORP combines the decision-making capability of multi-agent reinforcement learning with the idea of opportunistic routing to sequentially select a set of neighbors with larger values as potential forwarding nodes, thereby increasing the packet transmission success rate. Second, in the design of the MARL reward function, two reward functions for successful and unsuccessful packet transmission are designed jointly with cross-layer information to improve the routing protocol’s performance. Finally, two algorithmic optimization strategies, adaptive learning rate and Q-value initialization based on location and number of neighbors, are proposed to facilitate the faster adaptation of agents to the dynamic changes of network topology and accelerate CLORP convergence. The experimental results demonstrate that CLORP can increase algorithm convergence speed by 13.2%, reduce network energy consumption by 25%, and decrease network latency by 31.2%.
KW - Cross-Layer Information
KW - Internet of Underwater Things
KW - Multi-Agent Reinforcement Learning
KW - Network topology
KW - Opportunistic Routing
KW - Reinforcement learning
KW - Routing
KW - Routing protocols
KW - Sensors
KW - Topology
KW - Wireless sensor networks
UR - http://www.scopus.com/inward/record.url?scp=85189772039&partnerID=8YFLogxK
U2 - 10.1109/JSEN.2024.3383035
DO - 10.1109/JSEN.2024.3383035
M3 - Article
AN - SCOPUS:85189772039
SN - 1530-437X
SP - 1
JO - IEEE Sensors Journal
JF - IEEE Sensors Journal
ER -