Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning

Chongli Zhang; Tiejun Lv; Pingmu Huang; Zhipeng Lin; Jie Zeng; Yuan Ren

doi:10.3390/s23156822

Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning

Chongli Zhang, Tiejun Lv, Pingmu Huang, Zhipeng Lin^*, Jie Zeng, Yuan Ren

^*Corresponding author for this work

School of Cyberspace Science and Technology

Research output: Contribution to journal › Article › peer-review

Abstract

Wireless resource utilizations are the focus of future communication, which are used constantly to alleviate the communication quality problem caused by the explosive interference with increasing users, especially the inter-cell interference in the multi-cell multi-user systems. To tackle this interference and improve the resource utilization rate, we proposed a joint-priority-based reinforcement learning (JPRL) approach to jointly optimize the bandwidth and transmit power allocation. This method aims to maximize the average throughput of the system while suppressing the co-channel interference and guaranteeing the quality of service (QoS) constraint. Specifically, we de-coupled the joint problem into two sub-problems, i.e., the bandwidth assignment and power allocation sub-problems. The multi-agent double deep Q network (MADDQN) was developed to solve the bandwidth allocation sub-problem for each user and the prioritized multi-agent deep deterministic policy gradient (P-MADDPG) algorithm by deploying a prioritized replay buffer that is designed to handle the transmit power allocation sub-problem. Numerical results show that the proposed JPRL method could accelerate model training and outperform the alternative methods in terms of throughput. For example, the average throughput was approximately 10.4–15.5% better than the homogeneous-learning-based benchmarks, and about 17.3% higher than the genetic algorithm.

Original language	English
Article number	6822
Journal	Sensors
Volume	23
Issue number	15
DOIs	https://doi.org/10.3390/s23156822
Publication status	Published - Aug 2023

Keywords

joint-priority-based reinforcement learning (JPRL)
multi-cell multi-user system
prioritized replay buffer
throughput
uplink

Access to Document

10.3390/s23156822

Cite this

@article{541de0089ad64fadb6568c3313e8c5fb,

title = "Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning",

abstract = "Wireless resource utilizations are the focus of future communication, which are used constantly to alleviate the communication quality problem caused by the explosive interference with increasing users, especially the inter-cell interference in the multi-cell multi-user systems. To tackle this interference and improve the resource utilization rate, we proposed a joint-priority-based reinforcement learning (JPRL) approach to jointly optimize the bandwidth and transmit power allocation. This method aims to maximize the average throughput of the system while suppressing the co-channel interference and guaranteeing the quality of service (QoS) constraint. Specifically, we de-coupled the joint problem into two sub-problems, i.e., the bandwidth assignment and power allocation sub-problems. The multi-agent double deep Q network (MADDQN) was developed to solve the bandwidth allocation sub-problem for each user and the prioritized multi-agent deep deterministic policy gradient (P-MADDPG) algorithm by deploying a prioritized replay buffer that is designed to handle the transmit power allocation sub-problem. Numerical results show that the proposed JPRL method could accelerate model training and outperform the alternative methods in terms of throughput. For example, the average throughput was approximately 10.4–15.5% better than the homogeneous-learning-based benchmarks, and about 17.3% higher than the genetic algorithm.",

keywords = "joint-priority-based reinforcement learning (JPRL), multi-cell multi-user system, prioritized replay buffer, throughput, uplink",

author = "Chongli Zhang and Tiejun Lv and Pingmu Huang and Zhipeng Lin and Jie Zeng and Yuan Ren",

note = "Publisher Copyright: {\textcopyright} 2023 by the authors.",

year = "2023",

month = aug,

doi = "10.3390/s23156822",

language = "English",

volume = "23",

journal = "Sensors",

issn = "1424-8220",

publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",

number = "15",

}

TY - JOUR

T1 - Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning

AU - Zhang, Chongli

AU - Lv, Tiejun

AU - Huang, Pingmu

AU - Lin, Zhipeng

AU - Zeng, Jie

AU - Ren, Yuan

PY - 2023/8

Y1 - 2023/8

N2 - Wireless resource utilizations are the focus of future communication, which are used constantly to alleviate the communication quality problem caused by the explosive interference with increasing users, especially the inter-cell interference in the multi-cell multi-user systems. To tackle this interference and improve the resource utilization rate, we proposed a joint-priority-based reinforcement learning (JPRL) approach to jointly optimize the bandwidth and transmit power allocation. This method aims to maximize the average throughput of the system while suppressing the co-channel interference and guaranteeing the quality of service (QoS) constraint. Specifically, we de-coupled the joint problem into two sub-problems, i.e., the bandwidth assignment and power allocation sub-problems. The multi-agent double deep Q network (MADDQN) was developed to solve the bandwidth allocation sub-problem for each user and the prioritized multi-agent deep deterministic policy gradient (P-MADDPG) algorithm by deploying a prioritized replay buffer that is designed to handle the transmit power allocation sub-problem. Numerical results show that the proposed JPRL method could accelerate model training and outperform the alternative methods in terms of throughput. For example, the average throughput was approximately 10.4–15.5% better than the homogeneous-learning-based benchmarks, and about 17.3% higher than the genetic algorithm.

AB - Wireless resource utilizations are the focus of future communication, which are used constantly to alleviate the communication quality problem caused by the explosive interference with increasing users, especially the inter-cell interference in the multi-cell multi-user systems. To tackle this interference and improve the resource utilization rate, we proposed a joint-priority-based reinforcement learning (JPRL) approach to jointly optimize the bandwidth and transmit power allocation. This method aims to maximize the average throughput of the system while suppressing the co-channel interference and guaranteeing the quality of service (QoS) constraint. Specifically, we de-coupled the joint problem into two sub-problems, i.e., the bandwidth assignment and power allocation sub-problems. The multi-agent double deep Q network (MADDQN) was developed to solve the bandwidth allocation sub-problem for each user and the prioritized multi-agent deep deterministic policy gradient (P-MADDPG) algorithm by deploying a prioritized replay buffer that is designed to handle the transmit power allocation sub-problem. Numerical results show that the proposed JPRL method could accelerate model training and outperform the alternative methods in terms of throughput. For example, the average throughput was approximately 10.4–15.5% better than the homogeneous-learning-based benchmarks, and about 17.3% higher than the genetic algorithm.

KW - joint-priority-based reinforcement learning (JPRL)

KW - multi-cell multi-user system

KW - prioritized replay buffer

KW - throughput

KW - uplink

UR - http://www.scopus.com/inward/record.url?scp=85167764529&partnerID=8YFLogxK

U2 - 10.3390/s23156822

DO - 10.3390/s23156822

M3 - Article

C2 - 37571605

AN - SCOPUS:85167764529

SN - 1424-8220

VL - 23

JO - Sensors

JF - Sensors

IS - 15

M1 - 6822

ER -

Joint Optimization of Bandwidth and Power Allocation in Uplink Systems with Deep Reinforcement Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this