TY - JOUR
T1 - A Deep Reinforcement Learning Based Offloading Game in Edge Computing
AU - Zhan, Yufeng
AU - Guo, Song
AU - Li, Peng
AU - Zhang, Jiang
N1 - Publisher Copyright:
© 1968-2012 IEEE.
PY - 2020/6/1
Y1 - 2020/6/1
N2 - Edge computing is a new paradigm to provide strong computing capability at the edge of pervasive radio access networks close to users. A critical research challenge of edge computing is to design an efficient offloading strategy to decide which tasks can be offloaded to edge servers with limited resources. Although many research efforts attempt to address this challenge, they need centralized control, which is not practical because users are rational individuals with interests to maximize their benefits. In this article, we study to design a decentralized algorithm for computation offloading, so that users can independently choose their offloading decisions. Game theory has been applied in the algorithm design. Different from existing work, we address the challenge that users may refuse to expose their information about network bandwidth and preference. Therefore, it requires that our solution should make the offloading decision without such knowledge. We formulate the problem as a partially observable Markov decision process (POMDP), which is solved by a policy gradient deep reinforcement learning (DRL) based approach. Extensive simulation results show that our proposal significantly outperforms existing solutions.
AB - Edge computing is a new paradigm to provide strong computing capability at the edge of pervasive radio access networks close to users. A critical research challenge of edge computing is to design an efficient offloading strategy to decide which tasks can be offloaded to edge servers with limited resources. Although many research efforts attempt to address this challenge, they need centralized control, which is not practical because users are rational individuals with interests to maximize their benefits. In this article, we study to design a decentralized algorithm for computation offloading, so that users can independently choose their offloading decisions. Game theory has been applied in the algorithm design. Different from existing work, we address the challenge that users may refuse to expose their information about network bandwidth and preference. Therefore, it requires that our solution should make the offloading decision without such knowledge. We formulate the problem as a partially observable Markov decision process (POMDP), which is solved by a policy gradient deep reinforcement learning (DRL) based approach. Extensive simulation results show that our proposal significantly outperforms existing solutions.
KW - Edge computing
KW - Nash equilibrium
KW - computation offloading
KW - deep reinforcement learning (DRL)
KW - partially observable Markov decision process (POMDP)
UR - http://www.scopus.com/inward/record.url?scp=85078461466&partnerID=8YFLogxK
U2 - 10.1109/TC.2020.2969148
DO - 10.1109/TC.2020.2969148
M3 - Article
AN - SCOPUS:85078461466
SN - 0018-9340
VL - 69
SP - 883
EP - 893
JO - IEEE Transactions on Computers
JF - IEEE Transactions on Computers
IS - 6
M1 - 8967118
ER -