TY - JOUR
T1 - Multi-channel opportunistic spectrum access
T2 - A mixed-scale decision perspective
AU - Shen, Helong
AU - Wang, Kehao
AU - Yu, Jihong
AU - Chen, Lin
N1 - Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2022/2/15
Y1 - 2022/2/15
N2 - Consider a slot-based opportunistic communication system consisting of one transmitter, one receiver, and N two-state Markov channels. In each K continuous time slots on a large time scale, the transmitter probes one of N channels and chooses one to access in each time slot of the K time slots on a small time scale. For each successful access, one unit of reward is obtained. To maximize the cumulated reward over a time horizon of T, the joint probing (on a large time scale) and accessing (on a small time scale) problem can be cast into a mixed-scale partially observable Markovian decision process which is proved to PSPACE-Hard. Then the mixed-scale sequential decision-making problem is simplified into a probing decision problem on a large time scale. Considering the huge computing complexity of the large-scale probing decision, we present a simple heuristic policy which is to probe the best or the second-best channel in terms of available probability under different probing conditions regarding missing detection rate and false alarm one. Next, we derive several sets of sufficient conditions for different scenarios under which the proposed heuristic policy is optimal. Finally, the results of numerical experiments verify our theoretical analysis.
AB - Consider a slot-based opportunistic communication system consisting of one transmitter, one receiver, and N two-state Markov channels. In each K continuous time slots on a large time scale, the transmitter probes one of N channels and chooses one to access in each time slot of the K time slots on a small time scale. For each successful access, one unit of reward is obtained. To maximize the cumulated reward over a time horizon of T, the joint probing (on a large time scale) and accessing (on a small time scale) problem can be cast into a mixed-scale partially observable Markovian decision process which is proved to PSPACE-Hard. Then the mixed-scale sequential decision-making problem is simplified into a probing decision problem on a large time scale. Considering the huge computing complexity of the large-scale probing decision, we present a simple heuristic policy which is to probe the best or the second-best channel in terms of available probability under different probing conditions regarding missing detection rate and false alarm one. Next, we derive several sets of sufficient conditions for different scenarios under which the proposed heuristic policy is optimal. Finally, the results of numerical experiments verify our theoretical analysis.
KW - Mixed-scale
KW - Myopic policy
KW - Opportunistic access
KW - Optimality
KW - POMDP
UR - http://www.scopus.com/inward/record.url?scp=85122197030&partnerID=8YFLogxK
U2 - 10.1016/j.comcom.2021.12.012
DO - 10.1016/j.comcom.2021.12.012
M3 - Article
AN - SCOPUS:85122197030
SN - 0140-3664
VL - 184
SP - 118
EP - 127
JO - Computer Communications
JF - Computer Communications
ER -