Optimizing resource allocation in UAV-assisted ultra-dense networks for enhanced performance and security

Pei Gen Ye; Jun Zheng; Xiaojun Ren; Jinbin Huang; Zhenxin Zhang; Yan Pang; Guang Kou

doi:10.1016/j.ins.2024.120788

Optimizing resource allocation in UAV-assisted ultra-dense networks for enhanced performance and security

Pei Gen Ye, Jun Zheng^*, Xiaojun Ren, Jinbin Huang, Zhenxin Zhang, Yan Pang, Guang Kou

^*此作品的通讯作者

网络空间安全学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

The deployment of unmanned aerial vehicles (UAVs) in ultra-dense networks (UNDs) has significantly advanced network capabilities in 5G/6G environments, addressing coverage enhancement and security concerns. Our research presents a deep reinforcement learning (DRL) based approach designed to manage the increasing data traffic demands and limited communication resources in UAV-assisted UNDs. Traditional DRL methodologies often struggle with challenges like low sample efficiency and energy wastage, which can indirectly impact network security and stability. To address these concerns, we introduce the Stabilizing Transformers based Potential Driven Reinforcement Learning (STPD-RL) framework. STPD-RL optimizes critical network operations such as transmission link selection and power allocation, directly contributing to improved energy efficiency and robust network performance. Initially, we have refined the potential driven experience replay and implemented it into resource allocation in UAV-assisted UDN for the inaugural time. By assigning a potential energy function to each state in experience replay, users can employ intrinsic state supervision to learn from a spectrum of good and bad experiences. Subsequently, we have employed stabilizing transformers to hasten the learning trajectory for resource allocation policies, thereby enhancing the stability of model training. Furthermore, we have integrated potential driven experience replay and stabilizing transformers within the Proximal Policy Optimization algorithm, thus formulating our uniquely tailored STPD-PPO. In simulations with many users and base stations, STPD-PPO outperformed traditional PPO in metrics such as entropy loss, policy loss, and value loss. Results suggest that our STPD-PPO surpasses traditional DRL algorithms in several respects, including convergence rate, energy efficiency, total power consumption, and exploration capacity.

源语言	英语
文章编号	120788
期刊	Information Sciences
卷	679
DOI	https://doi.org/10.1016/j.ins.2024.120788
出版状态	已出版 - 9月 2024

联合国可持续发展目标

此成果有助于实现下列可持续发展目标：

访问文件

10.1016/j.ins.2024.120788

其它文件与链接

链接到 Scopus 的出版物

引用此

Ye, P. G., Zheng, J., Ren, X., Huang, J., Zhang, Z., Pang, Y., & Kou, G. (2024). Optimizing resource allocation in UAV-assisted ultra-dense networks for enhanced performance and security. Information Sciences, 679, 文章 120788. https://doi.org/10.1016/j.ins.2024.120788

@article{9621f401b3af4bc1baf4db84daf73d3f,

title = "Optimizing resource allocation in UAV-assisted ultra-dense networks for enhanced performance and security",

abstract = "The deployment of unmanned aerial vehicles (UAVs) in ultra-dense networks (UNDs) has significantly advanced network capabilities in 5G/6G environments, addressing coverage enhancement and security concerns. Our research presents a deep reinforcement learning (DRL) based approach designed to manage the increasing data traffic demands and limited communication resources in UAV-assisted UNDs. Traditional DRL methodologies often struggle with challenges like low sample efficiency and energy wastage, which can indirectly impact network security and stability. To address these concerns, we introduce the Stabilizing Transformers based Potential Driven Reinforcement Learning (STPD-RL) framework. STPD-RL optimizes critical network operations such as transmission link selection and power allocation, directly contributing to improved energy efficiency and robust network performance. Initially, we have refined the potential driven experience replay and implemented it into resource allocation in UAV-assisted UDN for the inaugural time. By assigning a potential energy function to each state in experience replay, users can employ intrinsic state supervision to learn from a spectrum of good and bad experiences. Subsequently, we have employed stabilizing transformers to hasten the learning trajectory for resource allocation policies, thereby enhancing the stability of model training. Furthermore, we have integrated potential driven experience replay and stabilizing transformers within the Proximal Policy Optimization algorithm, thus formulating our uniquely tailored STPD-PPO. In simulations with many users and base stations, STPD-PPO outperformed traditional PPO in metrics such as entropy loss, policy loss, and value loss. Results suggest that our STPD-PPO surpasses traditional DRL algorithms in several respects, including convergence rate, energy efficiency, total power consumption, and exploration capacity.",

keywords = "Deep reinforcement learning, Experience replay, Resource allocation, Transformers, Ultra-dense network",

author = "Ye, {Pei Gen} and Jun Zheng and Xiaojun Ren and Jinbin Huang and Zhenxin Zhang and Yan Pang and Guang Kou",

note = "Publisher Copyright: {\textcopyright} 2024",

year = "2024",

month = sep,

doi = "10.1016/j.ins.2024.120788",

language = "English",

volume = "679",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Optimizing resource allocation in UAV-assisted ultra-dense networks for enhanced performance and security

AU - Ye, Pei Gen

AU - Zheng, Jun

AU - Ren, Xiaojun

AU - Huang, Jinbin

AU - Zhang, Zhenxin

AU - Pang, Yan

AU - Kou, Guang

PY - 2024/9

Y1 - 2024/9

N2 - The deployment of unmanned aerial vehicles (UAVs) in ultra-dense networks (UNDs) has significantly advanced network capabilities in 5G/6G environments, addressing coverage enhancement and security concerns. Our research presents a deep reinforcement learning (DRL) based approach designed to manage the increasing data traffic demands and limited communication resources in UAV-assisted UNDs. Traditional DRL methodologies often struggle with challenges like low sample efficiency and energy wastage, which can indirectly impact network security and stability. To address these concerns, we introduce the Stabilizing Transformers based Potential Driven Reinforcement Learning (STPD-RL) framework. STPD-RL optimizes critical network operations such as transmission link selection and power allocation, directly contributing to improved energy efficiency and robust network performance. Initially, we have refined the potential driven experience replay and implemented it into resource allocation in UAV-assisted UDN for the inaugural time. By assigning a potential energy function to each state in experience replay, users can employ intrinsic state supervision to learn from a spectrum of good and bad experiences. Subsequently, we have employed stabilizing transformers to hasten the learning trajectory for resource allocation policies, thereby enhancing the stability of model training. Furthermore, we have integrated potential driven experience replay and stabilizing transformers within the Proximal Policy Optimization algorithm, thus formulating our uniquely tailored STPD-PPO. In simulations with many users and base stations, STPD-PPO outperformed traditional PPO in metrics such as entropy loss, policy loss, and value loss. Results suggest that our STPD-PPO surpasses traditional DRL algorithms in several respects, including convergence rate, energy efficiency, total power consumption, and exploration capacity.

AB - The deployment of unmanned aerial vehicles (UAVs) in ultra-dense networks (UNDs) has significantly advanced network capabilities in 5G/6G environments, addressing coverage enhancement and security concerns. Our research presents a deep reinforcement learning (DRL) based approach designed to manage the increasing data traffic demands and limited communication resources in UAV-assisted UNDs. Traditional DRL methodologies often struggle with challenges like low sample efficiency and energy wastage, which can indirectly impact network security and stability. To address these concerns, we introduce the Stabilizing Transformers based Potential Driven Reinforcement Learning (STPD-RL) framework. STPD-RL optimizes critical network operations such as transmission link selection and power allocation, directly contributing to improved energy efficiency and robust network performance. Initially, we have refined the potential driven experience replay and implemented it into resource allocation in UAV-assisted UDN for the inaugural time. By assigning a potential energy function to each state in experience replay, users can employ intrinsic state supervision to learn from a spectrum of good and bad experiences. Subsequently, we have employed stabilizing transformers to hasten the learning trajectory for resource allocation policies, thereby enhancing the stability of model training. Furthermore, we have integrated potential driven experience replay and stabilizing transformers within the Proximal Policy Optimization algorithm, thus formulating our uniquely tailored STPD-PPO. In simulations with many users and base stations, STPD-PPO outperformed traditional PPO in metrics such as entropy loss, policy loss, and value loss. Results suggest that our STPD-PPO surpasses traditional DRL algorithms in several respects, including convergence rate, energy efficiency, total power consumption, and exploration capacity.

KW - Deep reinforcement learning

KW - Experience replay

KW - Resource allocation

KW - Transformers

KW - Ultra-dense network

UR - http://www.scopus.com/inward/record.url?scp=85197560359&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2024.120788

DO - 10.1016/j.ins.2024.120788

M3 - Article

AN - SCOPUS:85197560359

SN - 0020-0255

VL - 679

JO - Information Sciences

JF - Information Sciences

M1 - 120788

ER -

Optimizing resource allocation in UAV-assisted ultra-dense networks for enhanced performance and security

摘要

联合国可持续发展目标

访问文件

其它文件与链接

指纹

引用此