Cooperative Resource Allocation Based on Soft Actor-Critic With Data Augmentation in Cellular Network

Yunhui Qin; Zhongshan Zhang; Wei Huangfu; Haijun Zhang; Keping Long

doi:10.1109/LWC.2022.3227033

Cooperative Resource Allocation Based on Soft Actor-Critic With Data Augmentation in Cellular Network

Yunhui Qin, Zhongshan Zhang^*, Wei Huangfu, Haijun Zhang, Keping Long

^*Corresponding author for this work

School of Cyberspace Science and Technology

Research output: Contribution to journal › Article › peer-review

6 Citations (Scopus)

Abstract

This letter investigates the cooperative resource allocation of cellular networks with simultaneous wireless information and power transfer in the time-varying channel environment. The soft actor-critic (SAC) algorithm is exploited to tackle the optimization problem which aims to find a feasible resource allocation policy to maximize the data rate and system fairness while minimizing the channel switching penalty. Considering the costly agent-to-environment interactions and the restricted empirical dataset of the SAC algorithm, this letter explores the permutation equivalence of the optimization objective, and designs two data augmentation schemes for the experience replay buffer of SAC. The cumulative discount reward shows that data augmentation assisted algorithms outperform the baseline in the learning speed. The simulation results referring to the average data rate and system fairness show that the proposed schemes benefit to the training model and effectively improve the performance of algorithms.

Original language	English
Pages (from-to)	396-400
Number of pages	5
Journal	IEEE Wireless Communications Letters
Volume	12
Issue number	3
DOIs	https://doi.org/10.1109/LWC.2022.3227033
Publication status	Published - 1 Mar 2023

Keywords

Cooperative resource allocation
deep reinforcement learning
soft actor-critic

Access to Document

10.1109/LWC.2022.3227033

Cite this

@article{c0da927b4199451bbc4358293d0ff16e,

title = "Cooperative Resource Allocation Based on Soft Actor-Critic With Data Augmentation in Cellular Network",

abstract = "This letter investigates the cooperative resource allocation of cellular networks with simultaneous wireless information and power transfer in the time-varying channel environment. The soft actor-critic (SAC) algorithm is exploited to tackle the optimization problem which aims to find a feasible resource allocation policy to maximize the data rate and system fairness while minimizing the channel switching penalty. Considering the costly agent-to-environment interactions and the restricted empirical dataset of the SAC algorithm, this letter explores the permutation equivalence of the optimization objective, and designs two data augmentation schemes for the experience replay buffer of SAC. The cumulative discount reward shows that data augmentation assisted algorithms outperform the baseline in the learning speed. The simulation results referring to the average data rate and system fairness show that the proposed schemes benefit to the training model and effectively improve the performance of algorithms.",

keywords = "Cooperative resource allocation, deep reinforcement learning, soft actor-critic",

author = "Yunhui Qin and Zhongshan Zhang and Wei Huangfu and Haijun Zhang and Keping Long",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2023",

month = mar,

day = "1",

doi = "10.1109/LWC.2022.3227033",

language = "English",

volume = "12",

pages = "396--400",

journal = "IEEE Wireless Communications Letters",

issn = "2162-2337",

publisher = "IEEE Communications Society",

number = "3",

}

TY - JOUR

T1 - Cooperative Resource Allocation Based on Soft Actor-Critic With Data Augmentation in Cellular Network

AU - Qin, Yunhui

AU - Zhang, Zhongshan

AU - Huangfu, Wei

AU - Zhang, Haijun

AU - Long, Keping

PY - 2023/3/1

Y1 - 2023/3/1

N2 - This letter investigates the cooperative resource allocation of cellular networks with simultaneous wireless information and power transfer in the time-varying channel environment. The soft actor-critic (SAC) algorithm is exploited to tackle the optimization problem which aims to find a feasible resource allocation policy to maximize the data rate and system fairness while minimizing the channel switching penalty. Considering the costly agent-to-environment interactions and the restricted empirical dataset of the SAC algorithm, this letter explores the permutation equivalence of the optimization objective, and designs two data augmentation schemes for the experience replay buffer of SAC. The cumulative discount reward shows that data augmentation assisted algorithms outperform the baseline in the learning speed. The simulation results referring to the average data rate and system fairness show that the proposed schemes benefit to the training model and effectively improve the performance of algorithms.

AB - This letter investigates the cooperative resource allocation of cellular networks with simultaneous wireless information and power transfer in the time-varying channel environment. The soft actor-critic (SAC) algorithm is exploited to tackle the optimization problem which aims to find a feasible resource allocation policy to maximize the data rate and system fairness while minimizing the channel switching penalty. Considering the costly agent-to-environment interactions and the restricted empirical dataset of the SAC algorithm, this letter explores the permutation equivalence of the optimization objective, and designs two data augmentation schemes for the experience replay buffer of SAC. The cumulative discount reward shows that data augmentation assisted algorithms outperform the baseline in the learning speed. The simulation results referring to the average data rate and system fairness show that the proposed schemes benefit to the training model and effectively improve the performance of algorithms.

KW - Cooperative resource allocation

KW - deep reinforcement learning

KW - soft actor-critic

UR - http://www.scopus.com/inward/record.url?scp=85144811553&partnerID=8YFLogxK

U2 - 10.1109/LWC.2022.3227033

DO - 10.1109/LWC.2022.3227033

M3 - Article

AN - SCOPUS:85144811553

SN - 2162-2337

VL - 12

SP - 396

EP - 400

JO - IEEE Wireless Communications Letters

JF - IEEE Wireless Communications Letters

IS - 3

ER -

Cooperative Resource Allocation Based on Soft Actor-Critic With Data Augmentation in Cellular Network

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this