A Unified Framework Combining Feature Extraction and Reward Estimation for Cognitive Radar Policy Prediction

Luyao Zhang, Jiadi Bao, Mengtao Zhu*, Chuan Huang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The increasingly flexible decision-making process of modern cognitive radar (CR) systems has placed considerable challenges on electronic reconnaissance systems or radar warning receivers, which must help to determine the appropriate countermeasure methods for these radar systems in modern complex electromagnetic environments. It is necessary to accurately predict CR’s internal action policy according to the interaction observations between CR and receivers under noncooperative scenarios. Such that useful adversarial information can be extracted by reconnaissance systems for afterward processing and analysis. From the perspective of the reconnaissance side, this work presents a unified CR action policy prediction framework that combines feature extraction and reward estimation processes without much prior knowledge about CR’s decision-making process in terms of CR reward structure and components. Focusing on the CR performing continuous control tasks, a method based on Bayesian nonparametric (BNP) theory is designed for automatic feature extraction from observed demonstration trajectories that are collected in various CR tasks. The proposed BNP method can significantly improve the multidimensional information representation ability of features. Further, a deep inverse reinforcement learning (DIRL) method is designed to estimate CR’s internal reward functions without the knowledge of function structures. The performance of the proposed method is evaluated and verified under simulated CR target tracking scenarios. Experimental results showed the superiority and effectiveness of the proposed method.

Original languageEnglish
Pages (from-to)12320-12333
Number of pages14
JournalIEEE Transactions on Aerospace and Electronic Systems
Volume61
Issue number5
DOIs
Publication statusPublished - Oct 2025
Externally publishedYes

Keywords

  • Bayesian nonparametric (BNP) model
  • cognitive radar (CR)
  • deep neural network
  • feature extraction
  • inverse reinforcement learning (IRL)

Fingerprint

Dive into the research topics of 'A Unified Framework Combining Feature Extraction and Reward Estimation for Cognitive Radar Policy Prediction'. Together they form a unique fingerprint.

Cite this