A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV

Nan He; Song Yang; Fan Li; Stojan Trajanovski; Fernando A. Kuipers; Xiaoming Fu

doi:10.1109/IWQOS52092.2021.9521285

A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV

Nan He, Song Yang, Fan Li, Stojan Trajanovski, Fernando A. Kuipers, Xiaoming Fu

计算机学院

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

24 引用（Scopus）

摘要

The efficacy of Network Function Virtualization (NFV) depends critically on (1) where the virtual network functions (VNFs) are placed and (2) how the traffic is routed. Unfortunately, these aspects are not easily optimized, especially under time-varying network states with different quality of service (QoS) requirements. Given the importance of NFV, many approaches have been proposed to solve the VNF placement and traffic routing problem. However, those prior approaches mainly assume that the state of the network is static and known, disregarding real-time network variations. To bridge that gap, in this paper, we formulate the VNF placement and traffic routing problem as a Markov Decision Process model to capture the dynamic network state transitions. In order to jointly minimize the delay and cost of NFV providers and maximize the revenue, we devise a customized Deep Reinforcement Learning (DRL) algorithm, called A-DDPG, for VNF placement and traffic routing in a real-time network. A-DDPG uses the attention mechanism to ascertain smooth network behavior within the general framework of network utility maximization (NUM). The simulation results show that A-DDPG outperforms the state-of-the-art in terms of network utility, delay, and cost.

源语言	英语
主期刊名	2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021
出版商	Institute of Electrical and Electronics Engineers Inc.
ISBN（电子版）	9781665414944
DOI	https://doi.org/10.1109/IWQOS52092.2021.9521285
出版状态	已出版 - 25 6月 2021
活动	29th IEEE/ACM International Symposium on Quality of Service, IWQOS 2021 - Virtual, Tokyo, 日本期限: 25 6月 2021 → 28 6月 2021

出版系列

姓名	2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021

会议

会议	29th IEEE/ACM International Symposium on Quality of Service, IWQOS 2021
国家/地区	日本
市	Virtual, Tokyo
时期	25/06/21 → 28/06/21

访问文件

10.1109/IWQOS52092.2021.9521285

其它文件与链接

链接到 Scopus 的出版物

引用此

He, N., Yang, S., Li, F., Trajanovski, S., Kuipers, F. A., & Fu, X. (2021). A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV. 在 2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021 (2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IWQOS52092.2021.9521285

@inproceedings{7dd9dd6a25084d4fbcd212dc46d60a44,

title = "A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV",

abstract = "The efficacy of Network Function Virtualization (NFV) depends critically on (1) where the virtual network functions (VNFs) are placed and (2) how the traffic is routed. Unfortunately, these aspects are not easily optimized, especially under time-varying network states with different quality of service (QoS) requirements. Given the importance of NFV, many approaches have been proposed to solve the VNF placement and traffic routing problem. However, those prior approaches mainly assume that the state of the network is static and known, disregarding real-time network variations. To bridge that gap, in this paper, we formulate the VNF placement and traffic routing problem as a Markov Decision Process model to capture the dynamic network state transitions. In order to jointly minimize the delay and cost of NFV providers and maximize the revenue, we devise a customized Deep Reinforcement Learning (DRL) algorithm, called A-DDPG, for VNF placement and traffic routing in a real-time network. A-DDPG uses the attention mechanism to ascertain smooth network behavior within the general framework of network utility maximization (NUM). The simulation results show that A-DDPG outperforms the state-of-the-art in terms of network utility, delay, and cost.",

keywords = "Network function virtualization, deep reinforcement learning, placement, routing",

author = "Nan He and Song Yang and Fan Li and Stojan Trajanovski and Kuipers, {Fernando A.} and Xiaoming Fu",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 29th IEEE/ACM International Symposium on Quality of Service, IWQOS 2021 ; Conference date: 25-06-2021 Through 28-06-2021",

year = "2021",

month = jun,

day = "25",

doi = "10.1109/IWQOS52092.2021.9521285",

language = "English",

series = "2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021",

address = "United States",

}

He, N, Yang, S, Li, F, Trajanovski, S, Kuipers, FA & Fu, X 2021, A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV. 在 2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021. 2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021, Institute of Electrical and Electronics Engineers Inc., 29th IEEE/ACM International Symposium on Quality of Service, IWQOS 2021, Virtual, Tokyo, 日本, 25/06/21. https://doi.org/10.1109/IWQOS52092.2021.9521285

A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV. / He, Nan; Yang, Song; Li, Fan 等.
2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021. Institute of Electrical and Electronics Engineers Inc., 2021. (2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - A-DDPG

T2 - 29th IEEE/ACM International Symposium on Quality of Service, IWQOS 2021

AU - He, Nan

AU - Yang, Song

AU - Li, Fan

AU - Trajanovski, Stojan

AU - Kuipers, Fernando A.

AU - Fu, Xiaoming

PY - 2021/6/25

Y1 - 2021/6/25

N2 - The efficacy of Network Function Virtualization (NFV) depends critically on (1) where the virtual network functions (VNFs) are placed and (2) how the traffic is routed. Unfortunately, these aspects are not easily optimized, especially under time-varying network states with different quality of service (QoS) requirements. Given the importance of NFV, many approaches have been proposed to solve the VNF placement and traffic routing problem. However, those prior approaches mainly assume that the state of the network is static and known, disregarding real-time network variations. To bridge that gap, in this paper, we formulate the VNF placement and traffic routing problem as a Markov Decision Process model to capture the dynamic network state transitions. In order to jointly minimize the delay and cost of NFV providers and maximize the revenue, we devise a customized Deep Reinforcement Learning (DRL) algorithm, called A-DDPG, for VNF placement and traffic routing in a real-time network. A-DDPG uses the attention mechanism to ascertain smooth network behavior within the general framework of network utility maximization (NUM). The simulation results show that A-DDPG outperforms the state-of-the-art in terms of network utility, delay, and cost.

AB - The efficacy of Network Function Virtualization (NFV) depends critically on (1) where the virtual network functions (VNFs) are placed and (2) how the traffic is routed. Unfortunately, these aspects are not easily optimized, especially under time-varying network states with different quality of service (QoS) requirements. Given the importance of NFV, many approaches have been proposed to solve the VNF placement and traffic routing problem. However, those prior approaches mainly assume that the state of the network is static and known, disregarding real-time network variations. To bridge that gap, in this paper, we formulate the VNF placement and traffic routing problem as a Markov Decision Process model to capture the dynamic network state transitions. In order to jointly minimize the delay and cost of NFV providers and maximize the revenue, we devise a customized Deep Reinforcement Learning (DRL) algorithm, called A-DDPG, for VNF placement and traffic routing in a real-time network. A-DDPG uses the attention mechanism to ascertain smooth network behavior within the general framework of network utility maximization (NUM). The simulation results show that A-DDPG outperforms the state-of-the-art in terms of network utility, delay, and cost.

KW - Network function virtualization

KW - deep reinforcement learning

KW - placement

KW - routing

UR - http://www.scopus.com/inward/record.url?scp=85115406035&partnerID=8YFLogxK

U2 - 10.1109/IWQOS52092.2021.9521285

DO - 10.1109/IWQOS52092.2021.9521285

M3 - Conference contribution

AN - SCOPUS:85115406035

T3 - 2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021

BT - 2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 25 June 2021 through 28 June 2021

ER -

He N, Yang S, Li F, Trajanovski S, Kuipers FA, Fu X. A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV. 在 2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021. Institute of Electrical and Electronics Engineers Inc. 2021. (2021 IEEE/ACM 29th International Symposium on Quality of Service, IWQOS 2021). doi: 10.1109/IWQOS52092.2021.9521285

A-DDPG: Attention Mechanism-based Deep Reinforcement Learning for NFV

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此