Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach

Honglin Xiang*, Jingyi Peng, Zhen Gao, Lingjie Li, Yang Yang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

3 引用 (Scopus)

摘要

The explosion in the number of smartphones and wearable devices brings the challenge of high achievable rate (AR) requirement, and D2D communications become the critical technology to solve this challenge. However, the co-channel interference caused by spectrum reusing and low delay requirement restrict D2D communications performance improvements. In this paper, we consider the cases of no time delay constraint and time delay constraint respectively, and design a joint power control and resource allocation scheme based on deep reinforcement learning (DRL) to maximize the AR of cellular users (CUEs) and D2D users (DUEs). Specifically, D2D pairs are considered multiple agents for reusing CUE spectrum, each agent can independently select spectrum resources and power without any prior information to ease interference. Furthermore, a double deep Q-network with priority sampling (Pr-DDQN) distributed algorithm is proposed, which helps agents to learn more dominant features during experience replay. Simulation results indicate that Pr-DDQN algorithm can obtain a higher AR than the present DRL algorithms. In particular, the probability of selecting low power of agents enlarges as the increase of the remaining transmission time, which demonstrates that the agents can successfully learn and perceive the implicit relationship of time delay constraint.

源语言英语
主期刊名2022 IEEE 96th Vehicular Technology Conference, VTC 2022-Fall 2022 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9781665454681
DOI
出版状态已出版 - 2022
活动96th IEEE Vehicular Technology Conference, VTC 2022-Fall 2022 - London, 英国
期限: 26 9月 202229 9月 2022

出版系列

姓名IEEE Vehicular Technology Conference
2022-September
ISSN(印刷版)1550-2252

会议

会议96th IEEE Vehicular Technology Conference, VTC 2022-Fall 2022
国家/地区英国
London
时期26/09/2229/09/22

指纹

探究 'Multi-Agent Power and Resource Allocation for D2D Communications: A Deep Reinforcement Learning Approach' 的科研主题。它们共同构成独一无二的指纹。

引用此