Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach

Honglin Xiang; Jingyi Peng; Zhen Gao; Lingjie Li; Yang Yang

doi:10.1109/VTC2022-Fall57202.2022.10012889

Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach

Honglin Xiang^*, Jingyi Peng, Zhen Gao, Lingjie Li, Yang Yang

^*Corresponding author for this work

Advanced Research Institute of Multidisciplinary Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

3 Citations (Scopus)

Abstract

The explosion in the number of smartphones and wearable devices brings the challenge of high achievable rate (AR) requirement, and D2D communications become the critical technology to solve this challenge. However, the co-channel interference caused by spectrum reusing and low delay requirement restrict D2D communications performance improvements. In this paper, we consider the cases of no time delay constraint and time delay constraint respectively, and design a joint power control and resource allocation scheme based on deep reinforcement learning (DRL) to maximize the AR of cellular users (CUEs) and D2D users (DUEs). Specifically, D2D pairs are considered multiple agents for reusing CUE spectrum, each agent can independently select spectrum resources and power without any prior information to ease interference. Furthermore, a double deep Q-network with priority sampling (Pr-DDQN) distributed algorithm is proposed, which helps agents to learn more dominant features during experience replay. Simulation results indicate that Pr-DDQN algorithm can obtain a higher AR than the present DRL algorithms. In particular, the probability of selecting low power of agents enlarges as the increase of the remaining transmission time, which demonstrates that the agents can successfully learn and perceive the implicit relationship of time delay constraint.

Original language	English
Title of host publication	2022 IEEE 96th Vehicular Technology Conference, VTC 2022-Fall 2022 - Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781665454681
DOIs	https://doi.org/10.1109/VTC2022-Fall57202.2022.10012889
Publication status	Published - 2022
Event	96th IEEE Vehicular Technology Conference, VTC 2022-Fall 2022 - London, United Kingdom Duration: 26 Sept 2022 → 29 Sept 2022

Publication series

Name	IEEE Vehicular Technology Conference
Volume	2022-September
ISSN (Print)	1550-2252

Conference

Conference	96th IEEE Vehicular Technology Conference, VTC 2022-Fall 2022
Country/Territory	United Kingdom
City	London
Period	26/09/22 → 29/09/22

Keywords

Device-to-device communications
deep reinforcement learning
power control
resource allocation

Access to Document

10.1109/VTC2022-Fall57202.2022.10012889

Cite this

Xiang, H., Peng, J., Gao, Z., Li, L., & Yang, Y. (2022). Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach. In 2022 IEEE 96th Vehicular Technology Conference, VTC 2022-Fall 2022 - Proceedings (IEEE Vehicular Technology Conference; Vol. 2022-September). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/VTC2022-Fall57202.2022.10012889

@inproceedings{ce2666bc5fc44744be08ecc142748a60,

title = "Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach",

abstract = "The explosion in the number of smartphones and wearable devices brings the challenge of high achievable rate (AR) requirement, and D2D communications become the critical technology to solve this challenge. However, the co-channel interference caused by spectrum reusing and low delay requirement restrict D2D communications performance improvements. In this paper, we consider the cases of no time delay constraint and time delay constraint respectively, and design a joint power control and resource allocation scheme based on deep reinforcement learning (DRL) to maximize the AR of cellular users (CUEs) and D2D users (DUEs). Specifically, D2D pairs are considered multiple agents for reusing CUE spectrum, each agent can independently select spectrum resources and power without any prior information to ease interference. Furthermore, a double deep Q-network with priority sampling (Pr-DDQN) distributed algorithm is proposed, which helps agents to learn more dominant features during experience replay. Simulation results indicate that Pr-DDQN algorithm can obtain a higher AR than the present DRL algorithms. In particular, the probability of selecting low power of agents enlarges as the increase of the remaining transmission time, which demonstrates that the agents can successfully learn and perceive the implicit relationship of time delay constraint.",

keywords = "Device-to-device communications, deep reinforcement learning, power control, resource allocation",

author = "Honglin Xiang and Jingyi Peng and Zhen Gao and Lingjie Li and Yang Yang",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 96th IEEE Vehicular Technology Conference, VTC 2022-Fall 2022 ; Conference date: 26-09-2022 Through 29-09-2022",

year = "2022",

doi = "10.1109/VTC2022-Fall57202.2022.10012889",

language = "English",

series = "IEEE Vehicular Technology Conference",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2022 IEEE 96th Vehicular Technology Conference, VTC 2022-Fall 2022 - Proceedings",

address = "United States",

}

Xiang, H, Peng, J, Gao, Z, Li, L & Yang, Y 2022, Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach. in 2022 IEEE 96th Vehicular Technology Conference, VTC 2022-Fall 2022 - Proceedings. IEEE Vehicular Technology Conference, vol. 2022-September, Institute of Electrical and Electronics Engineers Inc., 96th IEEE Vehicular Technology Conference, VTC 2022-Fall 2022, London, United Kingdom, 26/09/22. https://doi.org/10.1109/VTC2022-Fall57202.2022.10012889

Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach. / Xiang, Honglin; Peng, Jingyi; Gao, Zhen et al.
2022 IEEE 96th Vehicular Technology Conference, VTC 2022-Fall 2022 - Proceedings. Institute of Electrical and Electronics Engineers Inc., 2022. (IEEE Vehicular Technology Conference; Vol. 2022-September).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multi-Agent Power and Resource Allocation for D2D Communications

T2 - 96th IEEE Vehicular Technology Conference, VTC 2022-Fall 2022

AU - Xiang, Honglin

AU - Peng, Jingyi

AU - Gao, Zhen

AU - Li, Lingjie

AU - Yang, Yang

PY - 2022

Y1 - 2022

N2 - The explosion in the number of smartphones and wearable devices brings the challenge of high achievable rate (AR) requirement, and D2D communications become the critical technology to solve this challenge. However, the co-channel interference caused by spectrum reusing and low delay requirement restrict D2D communications performance improvements. In this paper, we consider the cases of no time delay constraint and time delay constraint respectively, and design a joint power control and resource allocation scheme based on deep reinforcement learning (DRL) to maximize the AR of cellular users (CUEs) and D2D users (DUEs). Specifically, D2D pairs are considered multiple agents for reusing CUE spectrum, each agent can independently select spectrum resources and power without any prior information to ease interference. Furthermore, a double deep Q-network with priority sampling (Pr-DDQN) distributed algorithm is proposed, which helps agents to learn more dominant features during experience replay. Simulation results indicate that Pr-DDQN algorithm can obtain a higher AR than the present DRL algorithms. In particular, the probability of selecting low power of agents enlarges as the increase of the remaining transmission time, which demonstrates that the agents can successfully learn and perceive the implicit relationship of time delay constraint.

AB - The explosion in the number of smartphones and wearable devices brings the challenge of high achievable rate (AR) requirement, and D2D communications become the critical technology to solve this challenge. However, the co-channel interference caused by spectrum reusing and low delay requirement restrict D2D communications performance improvements. In this paper, we consider the cases of no time delay constraint and time delay constraint respectively, and design a joint power control and resource allocation scheme based on deep reinforcement learning (DRL) to maximize the AR of cellular users (CUEs) and D2D users (DUEs). Specifically, D2D pairs are considered multiple agents for reusing CUE spectrum, each agent can independently select spectrum resources and power without any prior information to ease interference. Furthermore, a double deep Q-network with priority sampling (Pr-DDQN) distributed algorithm is proposed, which helps agents to learn more dominant features during experience replay. Simulation results indicate that Pr-DDQN algorithm can obtain a higher AR than the present DRL algorithms. In particular, the probability of selecting low power of agents enlarges as the increase of the remaining transmission time, which demonstrates that the agents can successfully learn and perceive the implicit relationship of time delay constraint.

KW - Device-to-device communications

KW - deep reinforcement learning

KW - power control

KW - resource allocation

UR - http://www.scopus.com/inward/record.url?scp=85146987937&partnerID=8YFLogxK

U2 - 10.1109/VTC2022-Fall57202.2022.10012889

DO - 10.1109/VTC2022-Fall57202.2022.10012889

M3 - Conference contribution

AN - SCOPUS:85146987937

T3 - IEEE Vehicular Technology Conference

BT - 2022 IEEE 96th Vehicular Technology Conference, VTC 2022-Fall 2022 - Proceedings

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 26 September 2022 through 29 September 2022

ER -

Xiang H, Peng J, Gao Z, Li L, Yang Y. Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach. In 2022 IEEE 96th Vehicular Technology Conference, VTC 2022-Fall 2022 - Proceedings. Institute of Electrical and Electronics Engineers Inc. 2022. (IEEE Vehicular Technology Conference). doi: 10.1109/VTC2022-Fall57202.2022.10012889

Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this