TY - JOUR
T1 - Dynamic Routing for Integrated Satellite-Terrestrial Networks
T2 - A Constrained Multi-Agent Reinforcement Learning Approach
AU - Lyu, Yifeng
AU - Hu, Han
AU - Fan, Rongfei
AU - Liu, Zhi
AU - An, Jianping
AU - Mao, Shiwen
N1 - Publisher Copyright:
© 1983-2012 IEEE.
PY - 2024/5/1
Y1 - 2024/5/1
N2 - The integrated satellite-terrestrial network (ISTN) system has experienced significant growth, offering seamless communication services in remote areas with limited terrestrial infrastructure. However, designing a routing scheme for ISTN is exceedingly difficult, primarily due to the heightened complexity resulting from the inclusion of additional ground stations, along with the requirement to satisfy various constraints related to satellite service quality. To address these challenges, we study packet routing with ground stations and satellites working jointly to transmit packets, while prioritizing fast communication and meeting energy efficiency and packet loss requirements. Specifically, we formulate the problem of packet routing with constraints as a max-min problem using the Lagrange method. Then we propose a novel constrained Multi-Agent reinforcement learning (MARL) dynamic routing algorithm named CMADR, which efficiently balances objective improvement and constraint satisfaction during the updating of policy and Lagrange multipliers. Finally, we conduct extensive experiments and an ablation study using the OneWeb and Telesat mega-constellations. Results demonstrate that CMADR reduces the packet delay by a minimum of 21% and 15%, while meeting stringent energy consumption and packet loss rate constraints, outperforming several baseline algorithms.
AB - The integrated satellite-terrestrial network (ISTN) system has experienced significant growth, offering seamless communication services in remote areas with limited terrestrial infrastructure. However, designing a routing scheme for ISTN is exceedingly difficult, primarily due to the heightened complexity resulting from the inclusion of additional ground stations, along with the requirement to satisfy various constraints related to satellite service quality. To address these challenges, we study packet routing with ground stations and satellites working jointly to transmit packets, while prioritizing fast communication and meeting energy efficiency and packet loss requirements. Specifically, we formulate the problem of packet routing with constraints as a max-min problem using the Lagrange method. Then we propose a novel constrained Multi-Agent reinforcement learning (MARL) dynamic routing algorithm named CMADR, which efficiently balances objective improvement and constraint satisfaction during the updating of policy and Lagrange multipliers. Finally, we conduct extensive experiments and an ablation study using the OneWeb and Telesat mega-constellations. Results demonstrate that CMADR reduces the packet delay by a minimum of 21% and 15%, while meeting stringent energy consumption and packet loss rate constraints, outperforming several baseline algorithms.
KW - Integrated satellite-terrestrial networks
KW - constrained multi-agent reinforcement learning
KW - dynamic routing algorithm
KW - end-to-end delay
UR - http://www.scopus.com/inward/record.url?scp=85187308085&partnerID=8YFLogxK
U2 - 10.1109/JSAC.2024.3365869
DO - 10.1109/JSAC.2024.3365869
M3 - Article
AN - SCOPUS:85187308085
SN - 0733-8716
VL - 42
SP - 1204
EP - 1218
JO - IEEE Journal on Selected Areas in Communications
JF - IEEE Journal on Selected Areas in Communications
IS - 5
ER -