A Deep Reinforcement Learning Based Offloading Game in Edge Computing

Yufeng Zhan; Song Guo; Peng Li; Jiang Zhang

doi:10.1109/TC.2020.2969148

A Deep Reinforcement Learning Based Offloading Game in Edge Computing

Yufeng Zhan, Song Guo, Peng Li^*, Jiang Zhang

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

175 引用（Scopus）

摘要

Edge computing is a new paradigm to provide strong computing capability at the edge of pervasive radio access networks close to users. A critical research challenge of edge computing is to design an efficient offloading strategy to decide which tasks can be offloaded to edge servers with limited resources. Although many research efforts attempt to address this challenge, they need centralized control, which is not practical because users are rational individuals with interests to maximize their benefits. In this article, we study to design a decentralized algorithm for computation offloading, so that users can independently choose their offloading decisions. Game theory has been applied in the algorithm design. Different from existing work, we address the challenge that users may refuse to expose their information about network bandwidth and preference. Therefore, it requires that our solution should make the offloading decision without such knowledge. We formulate the problem as a partially observable Markov decision process (POMDP), which is solved by a policy gradient deep reinforcement learning (DRL) based approach. Extensive simulation results show that our proposal significantly outperforms existing solutions.

源语言	英语
文章编号	8967118
页（从-至）	883-893
页数	11
期刊	IEEE Transactions on Computers
卷	69
期	6
DOI	https://doi.org/10.1109/TC.2020.2969148
出版状态	已出版 - 1 6月 2020
已对外发布	是

访问文件

10.1109/TC.2020.2969148

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{461b43b3bd9343b3aca2d5365d5a2ef3,

title = "A Deep Reinforcement Learning Based Offloading Game in Edge Computing",

abstract = "Edge computing is a new paradigm to provide strong computing capability at the edge of pervasive radio access networks close to users. A critical research challenge of edge computing is to design an efficient offloading strategy to decide which tasks can be offloaded to edge servers with limited resources. Although many research efforts attempt to address this challenge, they need centralized control, which is not practical because users are rational individuals with interests to maximize their benefits. In this article, we study to design a decentralized algorithm for computation offloading, so that users can independently choose their offloading decisions. Game theory has been applied in the algorithm design. Different from existing work, we address the challenge that users may refuse to expose their information about network bandwidth and preference. Therefore, it requires that our solution should make the offloading decision without such knowledge. We formulate the problem as a partially observable Markov decision process (POMDP), which is solved by a policy gradient deep reinforcement learning (DRL) based approach. Extensive simulation results show that our proposal significantly outperforms existing solutions.",

keywords = "Edge computing, Nash equilibrium, computation offloading, deep reinforcement learning (DRL), partially observable Markov decision process (POMDP)",

author = "Yufeng Zhan and Song Guo and Peng Li and Jiang Zhang",

note = "Publisher Copyright: {\textcopyright} 1968-2012 IEEE.",

year = "2020",

month = jun,

day = "1",

doi = "10.1109/TC.2020.2969148",

language = "English",

volume = "69",

pages = "883--893",

journal = "IEEE Transactions on Computers",

issn = "0018-9340",

publisher = "IEEE Computer Society",

number = "6",

}

TY - JOUR

T1 - A Deep Reinforcement Learning Based Offloading Game in Edge Computing

AU - Zhan, Yufeng

AU - Guo, Song

AU - Li, Peng

AU - Zhang, Jiang

PY - 2020/6/1

Y1 - 2020/6/1

N2 - Edge computing is a new paradigm to provide strong computing capability at the edge of pervasive radio access networks close to users. A critical research challenge of edge computing is to design an efficient offloading strategy to decide which tasks can be offloaded to edge servers with limited resources. Although many research efforts attempt to address this challenge, they need centralized control, which is not practical because users are rational individuals with interests to maximize their benefits. In this article, we study to design a decentralized algorithm for computation offloading, so that users can independently choose their offloading decisions. Game theory has been applied in the algorithm design. Different from existing work, we address the challenge that users may refuse to expose their information about network bandwidth and preference. Therefore, it requires that our solution should make the offloading decision without such knowledge. We formulate the problem as a partially observable Markov decision process (POMDP), which is solved by a policy gradient deep reinforcement learning (DRL) based approach. Extensive simulation results show that our proposal significantly outperforms existing solutions.

AB - Edge computing is a new paradigm to provide strong computing capability at the edge of pervasive radio access networks close to users. A critical research challenge of edge computing is to design an efficient offloading strategy to decide which tasks can be offloaded to edge servers with limited resources. Although many research efforts attempt to address this challenge, they need centralized control, which is not practical because users are rational individuals with interests to maximize their benefits. In this article, we study to design a decentralized algorithm for computation offloading, so that users can independently choose their offloading decisions. Game theory has been applied in the algorithm design. Different from existing work, we address the challenge that users may refuse to expose their information about network bandwidth and preference. Therefore, it requires that our solution should make the offloading decision without such knowledge. We formulate the problem as a partially observable Markov decision process (POMDP), which is solved by a policy gradient deep reinforcement learning (DRL) based approach. Extensive simulation results show that our proposal significantly outperforms existing solutions.

KW - Edge computing

KW - Nash equilibrium

KW - computation offloading

KW - deep reinforcement learning (DRL)

KW - partially observable Markov decision process (POMDP)

UR - http://www.scopus.com/inward/record.url?scp=85078461466&partnerID=8YFLogxK

U2 - 10.1109/TC.2020.2969148

DO - 10.1109/TC.2020.2969148

M3 - Article

AN - SCOPUS:85078461466

SN - 0018-9340

VL - 69

SP - 883

EP - 893

JO - IEEE Transactions on Computers

JF - IEEE Transactions on Computers

IS - 6

M1 - 8967118

ER -

A Deep Reinforcement Learning Based Offloading Game in Edge Computing

摘要

访问文件

其它文件与链接

指纹

引用此