TY - JOUR
T1 - DeepAntiJam
T2 - Stackelberg Game-Oriented Secure Transmission via Deep Reinforcement Learning
AU - Lu, Jianzhong
AU - He, Dongxuan
AU - Wang, Zhaocheng
N1 - Publisher Copyright:
© 1997-2012 IEEE.
PY - 2022/9/1
Y1 - 2022/9/1
N2 - In this letter, we present a novel deep reinforcement learning-assisted anti-jamming transmission scheme (DeepAntiJam) to guarantee reliable and creditable transmission in the presence of one smart jammer and multiple eavesdroppers. Specifically, we formulate secure transmission as a Stackelberg game in which the jammer, as the leader, adaptively adjusts its jamming power while the transmitter, as the follower, selects its transmit power and secrecy rate accordingly. Furthermore, the existence and uniqueness of Stackelberg equilibrium are proved. To achieve the Stackelberg equilibrium when the prior knowledge of jammer is unknown, DeepAntiJam is proposed to improve the secrecy throughput, where a solver neural network is utilized to determine the optimal transmission parameter according to the transmission strategy obtained by deep reinforcement learning. Moreover, transfer learning is introduced into the initialization of DeepAntiJam to avoid unnecessary initial random exploration. Simulation results validate that DeepAntiJam can enhance secrecy throughput significantly under the coexistence of smart jammer.
AB - In this letter, we present a novel deep reinforcement learning-assisted anti-jamming transmission scheme (DeepAntiJam) to guarantee reliable and creditable transmission in the presence of one smart jammer and multiple eavesdroppers. Specifically, we formulate secure transmission as a Stackelberg game in which the jammer, as the leader, adaptively adjusts its jamming power while the transmitter, as the follower, selects its transmit power and secrecy rate accordingly. Furthermore, the existence and uniqueness of Stackelberg equilibrium are proved. To achieve the Stackelberg equilibrium when the prior knowledge of jammer is unknown, DeepAntiJam is proposed to improve the secrecy throughput, where a solver neural network is utilized to determine the optimal transmission parameter according to the transmission strategy obtained by deep reinforcement learning. Moreover, transfer learning is introduced into the initialization of DeepAntiJam to avoid unnecessary initial random exploration. Simulation results validate that DeepAntiJam can enhance secrecy throughput significantly under the coexistence of smart jammer.
KW - Reinforcement learning
KW - Stackelberg game
KW - multiple eavesdroppers
KW - smart jammer
KW - transfer learning
UR - http://www.scopus.com/inward/record.url?scp=85132732571&partnerID=8YFLogxK
U2 - 10.1109/LCOMM.2022.3182001
DO - 10.1109/LCOMM.2022.3182001
M3 - Article
AN - SCOPUS:85132732571
SN - 1089-7798
VL - 26
SP - 1984
EP - 1988
JO - IEEE Communications Letters
JF - IEEE Communications Letters
IS - 9
ER -