Dynamic Routing for Integrated Satellite-Terrestrial Networks: A Constrained Multi-Agent Reinforcement Learning Approach

Yifeng Lyu; Han Hu; Rongfei Fan; Zhi Liu; Jianping An; Shiwen Mao

doi:10.1109/JSAC.2024.3365869

Dynamic Routing for Integrated Satellite-Terrestrial Networks: A Constrained Multi-Agent Reinforcement Learning Approach

Yifeng Lyu, Han Hu^*, Rongfei Fan, Zhi Liu, Jianping An, Shiwen Mao

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

6 Citations (Scopus)

Abstract

The integrated satellite-terrestrial network (ISTN) system has experienced significant growth, offering seamless communication services in remote areas with limited terrestrial infrastructure. However, designing a routing scheme for ISTN is exceedingly difficult, primarily due to the heightened complexity resulting from the inclusion of additional ground stations, along with the requirement to satisfy various constraints related to satellite service quality. To address these challenges, we study packet routing with ground stations and satellites working jointly to transmit packets, while prioritizing fast communication and meeting energy efficiency and packet loss requirements. Specifically, we formulate the problem of packet routing with constraints as a max-min problem using the Lagrange method. Then we propose a novel constrained Multi-Agent reinforcement learning (MARL) dynamic routing algorithm named CMADR, which efficiently balances objective improvement and constraint satisfaction during the updating of policy and Lagrange multipliers. Finally, we conduct extensive experiments and an ablation study using the OneWeb and Telesat mega-constellations. Results demonstrate that CMADR reduces the packet delay by a minimum of 21% and 15%, while meeting stringent energy consumption and packet loss rate constraints, outperforming several baseline algorithms.

Original language	English
Pages (from-to)	1204-1218
Number of pages	15
Journal	IEEE Journal on Selected Areas in Communications
Volume	42
Issue number	5
DOIs	https://doi.org/10.1109/JSAC.2024.3365869
Publication status	Published - 1 May 2024

Keywords

Integrated satellite-terrestrial networks
constrained multi-agent reinforcement learning
dynamic routing algorithm
end-to-end delay

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1109/JSAC.2024.3365869

Cite this

@article{e0994dd4322743eda04531dbcd583baa,

title = "Dynamic Routing for Integrated Satellite-Terrestrial Networks: A Constrained Multi-Agent Reinforcement Learning Approach",

abstract = "The integrated satellite-terrestrial network (ISTN) system has experienced significant growth, offering seamless communication services in remote areas with limited terrestrial infrastructure. However, designing a routing scheme for ISTN is exceedingly difficult, primarily due to the heightened complexity resulting from the inclusion of additional ground stations, along with the requirement to satisfy various constraints related to satellite service quality. To address these challenges, we study packet routing with ground stations and satellites working jointly to transmit packets, while prioritizing fast communication and meeting energy efficiency and packet loss requirements. Specifically, we formulate the problem of packet routing with constraints as a max-min problem using the Lagrange method. Then we propose a novel constrained Multi-Agent reinforcement learning (MARL) dynamic routing algorithm named CMADR, which efficiently balances objective improvement and constraint satisfaction during the updating of policy and Lagrange multipliers. Finally, we conduct extensive experiments and an ablation study using the OneWeb and Telesat mega-constellations. Results demonstrate that CMADR reduces the packet delay by a minimum of 21% and 15%, while meeting stringent energy consumption and packet loss rate constraints, outperforming several baseline algorithms.",

keywords = "Integrated satellite-terrestrial networks, constrained multi-agent reinforcement learning, dynamic routing algorithm, end-to-end delay",

author = "Yifeng Lyu and Han Hu and Rongfei Fan and Zhi Liu and Jianping An and Shiwen Mao",

note = "Publisher Copyright: {\textcopyright} 1983-2012 IEEE.",

year = "2024",

month = may,

day = "1",

doi = "10.1109/JSAC.2024.3365869",

language = "English",

volume = "42",

pages = "1204--1218",

journal = "IEEE Journal on Selected Areas in Communications",

issn = "0733-8716",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "5",

}

TY - JOUR

T1 - Dynamic Routing for Integrated Satellite-Terrestrial Networks

T2 - A Constrained Multi-Agent Reinforcement Learning Approach

AU - Lyu, Yifeng

AU - Hu, Han

AU - Fan, Rongfei

AU - Liu, Zhi

AU - An, Jianping

AU - Mao, Shiwen

PY - 2024/5/1

Y1 - 2024/5/1

N2 - The integrated satellite-terrestrial network (ISTN) system has experienced significant growth, offering seamless communication services in remote areas with limited terrestrial infrastructure. However, designing a routing scheme for ISTN is exceedingly difficult, primarily due to the heightened complexity resulting from the inclusion of additional ground stations, along with the requirement to satisfy various constraints related to satellite service quality. To address these challenges, we study packet routing with ground stations and satellites working jointly to transmit packets, while prioritizing fast communication and meeting energy efficiency and packet loss requirements. Specifically, we formulate the problem of packet routing with constraints as a max-min problem using the Lagrange method. Then we propose a novel constrained Multi-Agent reinforcement learning (MARL) dynamic routing algorithm named CMADR, which efficiently balances objective improvement and constraint satisfaction during the updating of policy and Lagrange multipliers. Finally, we conduct extensive experiments and an ablation study using the OneWeb and Telesat mega-constellations. Results demonstrate that CMADR reduces the packet delay by a minimum of 21% and 15%, while meeting stringent energy consumption and packet loss rate constraints, outperforming several baseline algorithms.

AB - The integrated satellite-terrestrial network (ISTN) system has experienced significant growth, offering seamless communication services in remote areas with limited terrestrial infrastructure. However, designing a routing scheme for ISTN is exceedingly difficult, primarily due to the heightened complexity resulting from the inclusion of additional ground stations, along with the requirement to satisfy various constraints related to satellite service quality. To address these challenges, we study packet routing with ground stations and satellites working jointly to transmit packets, while prioritizing fast communication and meeting energy efficiency and packet loss requirements. Specifically, we formulate the problem of packet routing with constraints as a max-min problem using the Lagrange method. Then we propose a novel constrained Multi-Agent reinforcement learning (MARL) dynamic routing algorithm named CMADR, which efficiently balances objective improvement and constraint satisfaction during the updating of policy and Lagrange multipliers. Finally, we conduct extensive experiments and an ablation study using the OneWeb and Telesat mega-constellations. Results demonstrate that CMADR reduces the packet delay by a minimum of 21% and 15%, while meeting stringent energy consumption and packet loss rate constraints, outperforming several baseline algorithms.

KW - Integrated satellite-terrestrial networks

KW - constrained multi-agent reinforcement learning

KW - dynamic routing algorithm

KW - end-to-end delay

UR - http://www.scopus.com/inward/record.url?scp=85187308085&partnerID=8YFLogxK

U2 - 10.1109/JSAC.2024.3365869

DO - 10.1109/JSAC.2024.3365869

M3 - Article

AN - SCOPUS:85187308085

SN - 0733-8716

VL - 42

SP - 1204

EP - 1218

JO - IEEE Journal on Selected Areas in Communications

JF - IEEE Journal on Selected Areas in Communications

IS - 5

ER -

Dynamic Routing for Integrated Satellite-Terrestrial Networks: A Constrained Multi-Agent Reinforcement Learning Approach

Abstract

Keywords

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this