Proactive Content Caching Based on Actor-Critic Reinforcement Learning for Mobile Edge Networks

Wei Jiang, Daquan Feng*, Yao Sun, Gang Feng, Zhenzhong Wang, Xiang Gen Xia

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

14 Citations (Scopus)

Abstract

Mobile edge caching/computing (MEC) has emerged as a promising approach for addressing the drastic increasing mobile data traffic by bringing high caching and computing capabilities to the edge of networks. Under MEC architecture, content providers (CPs) are allowed to lease some virtual machines (VMs) at MEC servers to proactively cache popular contents for improving users' quality of experience. The scalable cache resource model rises the challenge for determining the ideal number of leased VMs for CPs to obtain the minimum expected downloading delay of users at the lowest caching cost. To address these challenges, in this paper, we propose an actor-critic (AC) reinforcement learning based proactive caching policy for mobile edge networks without the prior knowledge of users' content demand. Specifically, we formulate the proactive caching problem under dynamical users' content demand as a Markov decision process and propose a AC based caching algorithm to minimize the caching cost and the expected downloading delay. Particularly, to reduce the computational complexity, a branching neural network is employed to approximate the policy function in the actor part. Numerical results show that the proposed caching algorithm can significantly reduce the total cost and the average downloading delay when compared with other popular algorithms.

Original languageEnglish
Pages (from-to)1239-1252
Number of pages14
JournalIEEE Transactions on Cognitive Communications and Networking
Volume8
Issue number2
DOIs
Publication statusPublished - 1 Jun 2022
Externally publishedYes

Keywords

  • Actor-critic algorithm
  • Branching neural network
  • Mobile edge caching
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Proactive Content Caching Based on Actor-Critic Reinforcement Learning for Mobile Edge Networks'. Together they form a unique fingerprint.

Cite this