TY - JOUR
T1 - Deep Reinforcement Learning-Based Downlink Beamforming and Phase Optimization for RIS-Aided Communication System
AU - Li, Lingjie
AU - Yang, Yang
AU - Bao, Lingyan
AU - Gao, Zhen
AU - Wu, Yongpeng
AU - Xiang, Honglin
N1 - Publisher Copyright:
© 2012 IEEE.
PY - 2023/12/1
Y1 - 2023/12/1
N2 - Reconfigurable intelligent surface (RIS) has been envisioned as a critical technology for the future wireless communication systems. However, the inter-cell interference and the unbearable signal processing complexity seriously constrain the wide application of RIS. In this letter, we investigate a deep reinforcement learning (DRL)-based scheme of downlink beamforming and phase optimization for a multi-cell RIS-aided communication system. With the transmit power constraint, a maximization problem of average ergodic rate is formulated. We design a deep deterministic policy gradient (DDPG)-based scheme, which enables base station (BS) and RIS to intelligently control the beamforming and phase shift, respectively. Further, a DDPG-based distributed training decentralized execution algorithm is proposed to acquire the feature difference among multi-cells and maximize the average ergodic rate. Simulation results show that the proposed algorithm outperforms the other baseline algorithms, which proves the agents can effectively learn and sense the change of multi-cell interference.
AB - Reconfigurable intelligent surface (RIS) has been envisioned as a critical technology for the future wireless communication systems. However, the inter-cell interference and the unbearable signal processing complexity seriously constrain the wide application of RIS. In this letter, we investigate a deep reinforcement learning (DRL)-based scheme of downlink beamforming and phase optimization for a multi-cell RIS-aided communication system. With the transmit power constraint, a maximization problem of average ergodic rate is formulated. We design a deep deterministic policy gradient (DDPG)-based scheme, which enables base station (BS) and RIS to intelligently control the beamforming and phase shift, respectively. Further, a DDPG-based distributed training decentralized execution algorithm is proposed to acquire the feature difference among multi-cells and maximize the average ergodic rate. Simulation results show that the proposed algorithm outperforms the other baseline algorithms, which proves the agents can effectively learn and sense the change of multi-cell interference.
KW - Beamforming
KW - deep reinforcement learning
KW - inter-cell interference
KW - phase shift
KW - reconfigurable intelligent surface
UR - http://www.scopus.com/inward/record.url?scp=85173034798&partnerID=8YFLogxK
U2 - 10.1109/LWC.2023.3318212
DO - 10.1109/LWC.2023.3318212
M3 - Article
AN - SCOPUS:85173034798
SN - 2162-2337
VL - 12
SP - 2263
EP - 2267
JO - IEEE Wireless Communications Letters
JF - IEEE Wireless Communications Letters
IS - 12
ER -