Skip to main navigation Skip to search Skip to main content

Heuristic Dual Q-Learning Based Radar Anti-Jamming Decision-Making

  • Shandong University
  • Qilu Institute of Technology
  • Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the increasing dynamics and complexity of the modern electronic countermeasure environment, the traditional anti-jamming decision-making methods are difficult to meet the real-time and intelligent demands of cognitive radar and some reinforcement learning algorithms have been designed and used in radar anti-jamming decision-making. However, aiming at the overestimation of Q-value and instability of training results for typical Q-learning, this paper proposes a collaborative optimization based on dual Q-tables and heuristic function for antijamming decision-making in cognitive radar system. It separates the action selection and value evaluation process with the help of two independent Q-tables, thus suppressing the valuation bias caused by the common single Q-table. Meanwhile, the heuristic function is dynamically designed to optimize the exploration process based on the optimal action and reward. Simulation results show that the proposed algorithm improves the decisionmaking accuracy by 16% on average compared with the popular Q-learning and 'State-Action-Reward-State-Action' (Sarsa) methods, and the stability of strategy selection is significantly enhanced, providing a reliable solution for the real-time decisionmaking of cognitive radar system in complex electromagnetic environment.

Original languageEnglish
Title of host publication2025 IEEE 25th International Conference on Communication Technology, ICCT 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1726-1730
Number of pages5
ISBN (Electronic)9798331585785
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event25th IEEE International Conference on Communication Technology, ICCT 2025 - Shenyang, China
Duration: 16 Oct 202518 Oct 2025

Publication series

NameInternational Conference on Communication Technology Proceedings, ICCT
ISSN (Print)2576-7844
ISSN (Electronic)2576-7828

Conference

Conference25th IEEE International Conference on Communication Technology, ICCT 2025
Country/TerritoryChina
CityShenyang
Period16/10/2518/10/25

Keywords

  • Markov decision process
  • Reinforcement learning
  • anti-jamming decision-making
  • cognitive radar

Fingerprint

Dive into the research topics of 'Heuristic Dual Q-Learning Based Radar Anti-Jamming Decision-Making'. Together they form a unique fingerprint.

Cite this