TY - JOUR
T1 - GC-DRQN
T2 - Enhancing Radar Anti-Jamming Performance With Supervised Auxiliary Tasks and Deterministic Rewards
AU - Zhang, Jiaxiang
AU - Wang, Bo
AU - Liang, Zhennan
AU - Fan, Huayu
AU - Liu, Quanhua
N1 - Publisher Copyright:
© 2005-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Self-defense suppression jammers pose a critical threat to radar by adaptively altering jamming frequencies based on intercepted radar pulses, which can mask the real targets. An effective countermeasure is to transmit a cover pulse before the detection pulse to deceive the jammer. In order to maximize radar detection performance while ensuring successful anti-jamming, reinforcement learning (RL) methods are employed to dynamically adjust the width ratio between the cover pulse and the detection pulse based on the jamming state and reward feedback. However, unknown jammer interception durations and interception-jamming cycles, along with the random feedback affected by noise, pose significant challenges to the pulse width selection based on RL methods. Inspired by biological intelligence to enhance RL, we propose a supervised learning (SL)-based general auxiliary task framework that emulates the spatiotemporal encoding characteristics of grid cells and time cells in mammalian brains to extract richer and more structured environmental information. Building on this, we introduce a flexible grid cell-deep recurrent Q-network (GC-DRQN) architecture, integrating SL and RL, which improves the performance of RL in handling tasks with temporal dependencies. Additionally, we implement a deterministic equivalent reward mechanism to overcome the instability in the RL convergence process caused by random rewards. Simulation results demonstrate that the pulse transmission strategy learned by GC-DRQN achieves significantly higher target detection probabilities compared to several baseline methods. Notably, in a low signal-to-noise ratio (SNR) scenario, GC-DRQN improves the target detection probability and convergence speed of DRQN by up to twofold.
AB - Self-defense suppression jammers pose a critical threat to radar by adaptively altering jamming frequencies based on intercepted radar pulses, which can mask the real targets. An effective countermeasure is to transmit a cover pulse before the detection pulse to deceive the jammer. In order to maximize radar detection performance while ensuring successful anti-jamming, reinforcement learning (RL) methods are employed to dynamically adjust the width ratio between the cover pulse and the detection pulse based on the jamming state and reward feedback. However, unknown jammer interception durations and interception-jamming cycles, along with the random feedback affected by noise, pose significant challenges to the pulse width selection based on RL methods. Inspired by biological intelligence to enhance RL, we propose a supervised learning (SL)-based general auxiliary task framework that emulates the spatiotemporal encoding characteristics of grid cells and time cells in mammalian brains to extract richer and more structured environmental information. Building on this, we introduce a flexible grid cell-deep recurrent Q-network (GC-DRQN) architecture, integrating SL and RL, which improves the performance of RL in handling tasks with temporal dependencies. Additionally, we implement a deterministic equivalent reward mechanism to overcome the instability in the RL convergence process caused by random rewards. Simulation results demonstrate that the pulse transmission strategy learned by GC-DRQN achieves significantly higher target detection probabilities compared to several baseline methods. Notably, in a low signal-to-noise ratio (SNR) scenario, GC-DRQN improves the target detection probability and convergence speed of DRQN by up to twofold.
KW - Cover pulse width decision
KW - deep reinforcement learning
KW - grid network
KW - radar anti-jamming
KW - supervised auxiliary task
UR - https://www.scopus.com/pages/publications/105020740213
U2 - 10.1109/TIFS.2025.3627879
DO - 10.1109/TIFS.2025.3627879
M3 - Article
AN - SCOPUS:105020740213
SN - 1556-6013
VL - 20
SP - 13416
EP - 13430
JO - IEEE Transactions on Information Forensics and Security
JF - IEEE Transactions on Information Forensics and Security
ER -