A Discriminative Convolutional Neural Network with Context-Aware Attention

Yuxiang Zhou, Lejian Liao, Yang Gao*, Heyan Huang, Xiaochi Wei

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

Feature representation and feature extraction are two crucial procedures in text mining. Convolutional Neural Networks (CNN) have shown overwhelming success for text-mining tasks, since they are capable of efficiently extracting n-gram features from source data. However, vanilla CNN has its own weaknesses on feature representation and feature extraction. A certain amount of filters in CNN are inevitably duplicate and thus hinder to discriminatively represent a given text. In addition, most existing CNN models extract features in a fixed way (i.e., max pooling) that either limit the CNN to local optimum nor without considering the relation between all features, thereby unable to learn a contextual n-gram features adaptively. In this article, we propose a discriminative CNN with context-Aware attention to solve the challenges of vanilla CNN. Specifically, our model mainly encourages discrimination across different filters via maximizing their earth mover distances and estimates the salience of feature candidates by considering the relation between context features. We validate carefully our findings against baselines on five benchmark datasets of classification and two datasets of summarization. The results of the experiments verify the competitive performance of our proposed model.

Original languageEnglish
Article number57
JournalACM Transactions on Intelligent Systems and Technology
Volume11
Issue number5
DOIs
Publication statusPublished - Sept 2020

Keywords

  • Text mining
  • attention method
  • convolution neural networks

Fingerprint

Dive into the research topics of 'A Discriminative Convolutional Neural Network with Context-Aware Attention'. Together they form a unique fingerprint.

Cite this