TY - GEN
T1 - Multimodal Reinforcement Learning Aided Dynamic Service Function Chain Deployment in Satellite-Terrestrial Network
AU - Li, Yuanfeng
AU - Zhang, Qi
AU - Yao, Haipeng
AU - Xin, Xiangjun
AU - Ran, Gao
AU - Wang, Fu
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - In recent years, Satellite-Terrestrial Networks (STNs) have garnered significant attention for extending network coverage to areas beyond the reach of traditional terrestrial networks. With the rapid expansion of STN applications, integrating Service Function Chaining (SFC) technology has become crucial for delivering differentiated services. However, the dynamic and complex structure of STNs presents significant challenges for SFC deployment. To address these, we propose a multimodal reinforcement learning algorithm that uses separate neural networks to process diverse STN data, enabling more effective SFC deployment decisions. Our approach includes a Graph Transformer for processing network states represented as graphs, capturing the relationships between nodes, links, and resource distributions. Additionally, two MLPs are used to handle QoS requests and global network information. Built on these components, the Proximal Policy Optimization (PPO)-based algorithm demonstrates superior performance over conventional AI methods, effectively learning optimal SFC deployment strategies.
AB - In recent years, Satellite-Terrestrial Networks (STNs) have garnered significant attention for extending network coverage to areas beyond the reach of traditional terrestrial networks. With the rapid expansion of STN applications, integrating Service Function Chaining (SFC) technology has become crucial for delivering differentiated services. However, the dynamic and complex structure of STNs presents significant challenges for SFC deployment. To address these, we propose a multimodal reinforcement learning algorithm that uses separate neural networks to process diverse STN data, enabling more effective SFC deployment decisions. Our approach includes a Graph Transformer for processing network states represented as graphs, capturing the relationships between nodes, links, and resource distributions. Additionally, two MLPs are used to handle QoS requests and global network information. Built on these components, the Proximal Policy Optimization (PPO)-based algorithm demonstrates superior performance over conventional AI methods, effectively learning optimal SFC deployment strategies.
KW - Multimodal reinforcement learning
KW - Satellite-terrestrial network
KW - Service Function Chaining
KW - Virtual network functions
UR - https://www.scopus.com/pages/publications/105011348292
U2 - 10.1109/IWCMC65282.2025.11059479
DO - 10.1109/IWCMC65282.2025.11059479
M3 - Conference contribution
AN - SCOPUS:105011348292
T3 - 21st International Wireless Communications and Mobile Computing Conference, IWCMC 2025
SP - 379
EP - 385
BT - 21st International Wireless Communications and Mobile Computing Conference, IWCMC 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 21st IEEE International Wireless Communications and Mobile Computing Conference, IWCMC 2025
Y2 - 12 May 2024 through 16 May 2024
ER -