Intelligent Spectrum Sensing and Access with Partial Observation Based on Hierarchical Multi-Agent Deep Reinforcement Learning

Xuanheng Li; Yulong Zhang; Haichuan Ding; Yuguang Fang

doi:10.1109/TWC.2023.3305567

Intelligent Spectrum Sensing and Access with Partial Observation Based on Hierarchical Multi-Agent Deep Reinforcement Learning

Xuanheng Li^*, Yulong Zhang, Haichuan Ding, Yuguang Fang

^*Corresponding author for this work

School of Cyberspace Science and Technology

Research output: Contribution to journal › Article › peer-review

8 Citations (Scopus)

Abstract

Dynamic spectrum access (DSA) has been regarded as a viable solution to the spectrum shortage problem. To find idle spectrum, partial spectrum sensing could be employed by selecting a suitable sensing window (SW). Since the SW selection determines how many available bands to access, the transmission performance after the access could be used to guide the SW selection. Hence, a sophisticated joint design on spectrum sensing and access is necessary, which, however, is a challenging task when considering the dynamic nature of spectrum environment, and also the mutual impact among different secondary users (SUs). In this paper, we propose a joint partial spectrum sensing and power allocation (PA) scheme to facilitate SUs to make the best decisions on SW and PA to maximize the network throughput with reduced mutual interference. Considering the environmental dynamics and spectrum uncertainty, we develop a viable solution based on hierarchical multi-agent deep reinforcement learning (HMADRL). Our solution enables mutual design with two stages: making each SU learn the best SW and PA strategies autonomously while adapting to the dynamic environment. By using both simulated spectrum data and real spectrum data measured by SAM60-BX, we have demonstrated the effectiveness of our proposed scheme.

Original language	English
Pages (from-to)	3131-3145
Number of pages	15
Journal	IEEE Transactions on Wireless Communications
Volume	23
Issue number	4
DOIs	https://doi.org/10.1109/TWC.2023.3305567
Publication status	Published - 1 Apr 2024

Keywords

Dynamic spectrum access (DSA)
hierarchical deep reinforcement learning
multi-agent
partial spectrum sensing
power allocation

Access to Document

10.1109/TWC.2023.3305567

Cite this

Li, X., Zhang, Y., Ding, H., & Fang, Y. (2024). Intelligent Spectrum Sensing and Access with Partial Observation Based on Hierarchical Multi-Agent Deep Reinforcement Learning. IEEE Transactions on Wireless Communications, 23(4), 3131-3145. https://doi.org/10.1109/TWC.2023.3305567

@article{011b17265a7142f2b76b75a3e694019a,

title = "Intelligent Spectrum Sensing and Access with Partial Observation Based on Hierarchical Multi-Agent Deep Reinforcement Learning",

abstract = "Dynamic spectrum access (DSA) has been regarded as a viable solution to the spectrum shortage problem. To find idle spectrum, partial spectrum sensing could be employed by selecting a suitable sensing window (SW). Since the SW selection determines how many available bands to access, the transmission performance after the access could be used to guide the SW selection. Hence, a sophisticated joint design on spectrum sensing and access is necessary, which, however, is a challenging task when considering the dynamic nature of spectrum environment, and also the mutual impact among different secondary users (SUs). In this paper, we propose a joint partial spectrum sensing and power allocation (PA) scheme to facilitate SUs to make the best decisions on SW and PA to maximize the network throughput with reduced mutual interference. Considering the environmental dynamics and spectrum uncertainty, we develop a viable solution based on hierarchical multi-agent deep reinforcement learning (HMADRL). Our solution enables mutual design with two stages: making each SU learn the best SW and PA strategies autonomously while adapting to the dynamic environment. By using both simulated spectrum data and real spectrum data measured by SAM60-BX, we have demonstrated the effectiveness of our proposed scheme.",

keywords = "Dynamic spectrum access (DSA), hierarchical deep reinforcement learning, multi-agent, partial spectrum sensing, power allocation",

author = "Xuanheng Li and Yulong Zhang and Haichuan Ding and Yuguang Fang",

note = "Publisher Copyright: {\textcopyright} 2002-2012 IEEE.",

year = "2024",

month = apr,

day = "1",

doi = "10.1109/TWC.2023.3305567",

language = "English",

volume = "23",

pages = "3131--3145",

journal = "IEEE Transactions on Wireless Communications",

issn = "1536-1276",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "4",

}

TY - JOUR

T1 - Intelligent Spectrum Sensing and Access with Partial Observation Based on Hierarchical Multi-Agent Deep Reinforcement Learning

AU - Li, Xuanheng

AU - Zhang, Yulong

AU - Ding, Haichuan

AU - Fang, Yuguang

PY - 2024/4/1

Y1 - 2024/4/1

N2 - Dynamic spectrum access (DSA) has been regarded as a viable solution to the spectrum shortage problem. To find idle spectrum, partial spectrum sensing could be employed by selecting a suitable sensing window (SW). Since the SW selection determines how many available bands to access, the transmission performance after the access could be used to guide the SW selection. Hence, a sophisticated joint design on spectrum sensing and access is necessary, which, however, is a challenging task when considering the dynamic nature of spectrum environment, and also the mutual impact among different secondary users (SUs). In this paper, we propose a joint partial spectrum sensing and power allocation (PA) scheme to facilitate SUs to make the best decisions on SW and PA to maximize the network throughput with reduced mutual interference. Considering the environmental dynamics and spectrum uncertainty, we develop a viable solution based on hierarchical multi-agent deep reinforcement learning (HMADRL). Our solution enables mutual design with two stages: making each SU learn the best SW and PA strategies autonomously while adapting to the dynamic environment. By using both simulated spectrum data and real spectrum data measured by SAM60-BX, we have demonstrated the effectiveness of our proposed scheme.

AB - Dynamic spectrum access (DSA) has been regarded as a viable solution to the spectrum shortage problem. To find idle spectrum, partial spectrum sensing could be employed by selecting a suitable sensing window (SW). Since the SW selection determines how many available bands to access, the transmission performance after the access could be used to guide the SW selection. Hence, a sophisticated joint design on spectrum sensing and access is necessary, which, however, is a challenging task when considering the dynamic nature of spectrum environment, and also the mutual impact among different secondary users (SUs). In this paper, we propose a joint partial spectrum sensing and power allocation (PA) scheme to facilitate SUs to make the best decisions on SW and PA to maximize the network throughput with reduced mutual interference. Considering the environmental dynamics and spectrum uncertainty, we develop a viable solution based on hierarchical multi-agent deep reinforcement learning (HMADRL). Our solution enables mutual design with two stages: making each SU learn the best SW and PA strategies autonomously while adapting to the dynamic environment. By using both simulated spectrum data and real spectrum data measured by SAM60-BX, we have demonstrated the effectiveness of our proposed scheme.

KW - Dynamic spectrum access (DSA)

KW - hierarchical deep reinforcement learning

KW - multi-agent

KW - partial spectrum sensing

KW - power allocation

UR - http://www.scopus.com/inward/record.url?scp=85168652488&partnerID=8YFLogxK

U2 - 10.1109/TWC.2023.3305567

DO - 10.1109/TWC.2023.3305567

M3 - Article

AN - SCOPUS:85168652488

SN - 1536-1276

VL - 23

SP - 3131

EP - 3145

JO - IEEE Transactions on Wireless Communications

JF - IEEE Transactions on Wireless Communications

IS - 4

ER -

Intelligent Spectrum Sensing and Access with Partial Observation Based on Hierarchical Multi-Agent Deep Reinforcement Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this