Skip to main navigation Skip to search Skip to main content

Modulation feature enhancement with a multi-stage attention network for underwater acoustic target recognition

  • Jiaping Yu
  • , Shefeng Yan*
  • , Linlin Mao
  • , Zeping Sui
  • , Chunjin Jiang
  • *Corresponding author for this work
  • CAS - Institute of Acoustics
  • University of Chinese Academy of Sciences
  • University of Essex

Research output: Contribution to journalArticlepeer-review

Abstract

Underwater acoustic target recognition is critical for maritime applications, yet it faces challenges arising from the complex and diverse nature of ship-radiated noise. To address these issues, we propose a robust deep learning-based framework. First, we introduce a feature extraction and fusion method based on variational mode decomposition (VMD) and the 3/2-D spectrum to generate high-fidelity 2-D DEMON spectral features, which effectively capture modulation envelope information. To further enhance feature representation, we design a one-dimensional convolutional neural network (1-D CNN) integrated with a novel Multi-Stage Multi-Type Attention Mechanism (MMATT) that adaptively refines features at different network depths. Within this mechanism, we propose a Residual Channel-Independent Spectral Attention Mechanism (R-CISAM) and a Multi-Scale Separate-and-Fuse Spectral Attention Mechanism (MS-SFSAM). Moreover, to mitigate performance degradation caused by severe class imbalance inherent in real-world ship-radiated noise data, we devise an Adjustable Class-Balanced Focal Loss (ACBFL), which provides flexibility across tasks with varying degrees of imbalance. Experimental results on a real-world ship-radiated noise dataset demonstrate that the proposed solutions effectively enhance underwater acoustic target recognition performance.

Original languageEnglish
Article number110669
JournalSignal Processing
Volume247
DOIs
Publication statusPublished - Oct 2026
Externally publishedYes

Keywords

  • Attention mechanism
  • Class imbalance
  • Deep learning
  • Feature fusion
  • Ship-radiated noise
  • Underwater acoustic target recognition

Fingerprint

Dive into the research topics of 'Modulation feature enhancement with a multi-stage attention network for underwater acoustic target recognition'. Together they form a unique fingerprint.

Cite this