Extracting salient features from convolutional discriminative filters

Yuxiang Zhou, Lejian Liao, Yang Gao*, Heyan Huang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)

Abstract

Convolutional neural networks (CNN) have been widely used in various tasks, largely due to their ability to efficiently extract n-gram features for text analysis and document representation. In this paper, we intend to insight the CNN model regarding its capability on text analysis. Vanilla CNNs do have weaknesses when it comes to the representation and feature extraction. Duplicate filters are inevitable with vanilla CNNs, which reduces the discriminative power of the representations. In addition, the current pooling operations either limit the CNN to the local optimum (i.e., max pooling) or they do not consider the importance of all features (i.e., mean pooling). In this paper, we propose two modules for vanilla CNNs to overcome these shortcomings. The first equips the CNN with discriminative filters (distinct filters with maximised divergence) and the second provides the ability to comprehensively extract all salient features. Specifically, our model increases the discriminative power of the model by maximizing the distance between different filters, and a novel global pooling mechanism for feature extraction. Validation tests against state-of-the-art baselines on five benchmark classification datasets achieve the competitive performance of our proposed model. Furthermore, visualization on upgrade filters and pooling features verify our hypothesis that the proposed model can receive discriminative filters and salient features.

Original languageEnglish
Pages (from-to)265-279
Number of pages15
JournalInformation Sciences
Volume558
DOIs
Publication statusPublished - May 2021

Keywords

  • Convolutional neural network (CNN)
  • Discriminative filters
  • Document representation
  • Pooling mechanism
  • Salient feature
  • Text classification

Fingerprint

Dive into the research topics of 'Extracting salient features from convolutional discriminative filters'. Together they form a unique fingerprint.

Cite this