TY - JOUR
T1 - On Adaptive Edge Microservice Placement
T2 - A Reinforcement Learning Approach Endowed with Graph Comprehension
AU - Chen, Lixing
AU - Bai, Yang
AU - Zhou, Pan
AU - Li, Youqi
AU - Qu, Zhe
AU - Xu, Jie
N1 - Publisher Copyright:
© 2002-2012 IEEE.
PY - 2024
Y1 - 2024
N2 - Microservice (MS) structures a service application as a collection of independently deployable service modules, making it particularly suitable for delivering complex applications in distributed computing systems. This article investigates MS architecture over Mobile Edge Computing (MEC) networks (hereafter referred to as EdgeMS) and studies an EdgeMS placement problem that aims to deploy MS modules over the MEC network in a manner that maximizes the reward of MS application providers. A novel algorithm called Dual-GNN Deep Deterministic Policy Gradient (DG-DDPG) is proposed to establish an intelligent EdgeMS placement policy for optimizing the location of MS modules and performing fractional computing resource allocation. DG-DDPG leverages the graph neural network (GNN) to comprehend the graph-structured information encapsulated in the MS application structure and MEC network. A dual-GNN core is constructed in DG-DDPG, one GNN for MS applications to distill knowledge from intricate connections between MS modules, and the other GNN for MEC networks to capture complicated interactions between edge sites when providing EdgeMS. DG-DDPG embeds the dual-GNN core in a DDPG-based reinforcement learning framework, which not only handles temporal dependencies between EdgeMS placement decisions for maximizing long-term reward but also supports continuous action space for enabling fractional resource allocation. In particular, the learning process of DG-DDPG is tailored to address hard constraints (i.e., computing capacity and MS application completeness) in the EdgeMS placement problem. We design constraint-based regularization terms and add them to the objective of DG-DDPG, which facilitates the identification of feasible placement decisions during learning. We carry out systematic experiments to evaluate the performance of DG-DDPG, and the results show that DG-DDPG outperforms state-of-the-art benchmarks in terms of reward, service delay and deployment cost.
AB - Microservice (MS) structures a service application as a collection of independently deployable service modules, making it particularly suitable for delivering complex applications in distributed computing systems. This article investigates MS architecture over Mobile Edge Computing (MEC) networks (hereafter referred to as EdgeMS) and studies an EdgeMS placement problem that aims to deploy MS modules over the MEC network in a manner that maximizes the reward of MS application providers. A novel algorithm called Dual-GNN Deep Deterministic Policy Gradient (DG-DDPG) is proposed to establish an intelligent EdgeMS placement policy for optimizing the location of MS modules and performing fractional computing resource allocation. DG-DDPG leverages the graph neural network (GNN) to comprehend the graph-structured information encapsulated in the MS application structure and MEC network. A dual-GNN core is constructed in DG-DDPG, one GNN for MS applications to distill knowledge from intricate connections between MS modules, and the other GNN for MEC networks to capture complicated interactions between edge sites when providing EdgeMS. DG-DDPG embeds the dual-GNN core in a DDPG-based reinforcement learning framework, which not only handles temporal dependencies between EdgeMS placement decisions for maximizing long-term reward but also supports continuous action space for enabling fractional resource allocation. In particular, the learning process of DG-DDPG is tailored to address hard constraints (i.e., computing capacity and MS application completeness) in the EdgeMS placement problem. We design constraint-based regularization terms and add them to the objective of DG-DDPG, which facilitates the identification of feasible placement decisions during learning. We carry out systematic experiments to evaluate the performance of DG-DDPG, and the results show that DG-DDPG outperforms state-of-the-art benchmarks in terms of reward, service delay and deployment cost.
KW - Mobile edge computing
KW - graph neural network
KW - microservices
KW - reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85192154455&partnerID=8YFLogxK
U2 - 10.1109/TMC.2024.3396510
DO - 10.1109/TMC.2024.3396510
M3 - Article
AN - SCOPUS:85192154455
SN - 1536-1233
VL - 23
SP - 11144
EP - 11158
JO - IEEE Transactions on Mobile Computing
JF - IEEE Transactions on Mobile Computing
IS - 12
ER -