TY - JOUR
T1 - Intelligent Spectrum Sensing and Access with Partial Observation Based on Hierarchical Multi-Agent Deep Reinforcement Learning
AU - Li, Xuanheng
AU - Zhang, Yulong
AU - Ding, Haichuan
AU - Fang, Yuguang
N1 - Publisher Copyright:
© 2002-2012 IEEE.
PY - 2024/4/1
Y1 - 2024/4/1
N2 - Dynamic spectrum access (DSA) has been regarded as a viable solution to the spectrum shortage problem. To find idle spectrum, partial spectrum sensing could be employed by selecting a suitable sensing window (SW). Since the SW selection determines how many available bands to access, the transmission performance after the access could be used to guide the SW selection. Hence, a sophisticated joint design on spectrum sensing and access is necessary, which, however, is a challenging task when considering the dynamic nature of spectrum environment, and also the mutual impact among different secondary users (SUs). In this paper, we propose a joint partial spectrum sensing and power allocation (PA) scheme to facilitate SUs to make the best decisions on SW and PA to maximize the network throughput with reduced mutual interference. Considering the environmental dynamics and spectrum uncertainty, we develop a viable solution based on hierarchical multi-agent deep reinforcement learning (HMADRL). Our solution enables mutual design with two stages: making each SU learn the best SW and PA strategies autonomously while adapting to the dynamic environment. By using both simulated spectrum data and real spectrum data measured by SAM60-BX, we have demonstrated the effectiveness of our proposed scheme.
AB - Dynamic spectrum access (DSA) has been regarded as a viable solution to the spectrum shortage problem. To find idle spectrum, partial spectrum sensing could be employed by selecting a suitable sensing window (SW). Since the SW selection determines how many available bands to access, the transmission performance after the access could be used to guide the SW selection. Hence, a sophisticated joint design on spectrum sensing and access is necessary, which, however, is a challenging task when considering the dynamic nature of spectrum environment, and also the mutual impact among different secondary users (SUs). In this paper, we propose a joint partial spectrum sensing and power allocation (PA) scheme to facilitate SUs to make the best decisions on SW and PA to maximize the network throughput with reduced mutual interference. Considering the environmental dynamics and spectrum uncertainty, we develop a viable solution based on hierarchical multi-agent deep reinforcement learning (HMADRL). Our solution enables mutual design with two stages: making each SU learn the best SW and PA strategies autonomously while adapting to the dynamic environment. By using both simulated spectrum data and real spectrum data measured by SAM60-BX, we have demonstrated the effectiveness of our proposed scheme.
KW - Dynamic spectrum access (DSA)
KW - hierarchical deep reinforcement learning
KW - multi-agent
KW - partial spectrum sensing
KW - power allocation
UR - http://www.scopus.com/inward/record.url?scp=85168652488&partnerID=8YFLogxK
U2 - 10.1109/TWC.2023.3305567
DO - 10.1109/TWC.2023.3305567
M3 - Article
AN - SCOPUS:85168652488
SN - 1536-1276
VL - 23
SP - 3131
EP - 3145
JO - IEEE Transactions on Wireless Communications
JF - IEEE Transactions on Wireless Communications
IS - 4
ER -