A Discriminative Convolutional Neural Network with Context-Aware Attention

Yuxiang Zhou; Lejian Liao; Yang Gao; Heyan Huang; Xiaochi Wei

doi:10.1145/3397464

A Discriminative Convolutional Neural Network with Context-Aware Attention

Yuxiang Zhou, Lejian Liao, Yang Gao^*, Heyan Huang, Xiaochi Wei

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

4 引用（Scopus）

摘要

Feature representation and feature extraction are two crucial procedures in text mining. Convolutional Neural Networks (CNN) have shown overwhelming success for text-mining tasks, since they are capable of efficiently extracting n-gram features from source data. However, vanilla CNN has its own weaknesses on feature representation and feature extraction. A certain amount of filters in CNN are inevitably duplicate and thus hinder to discriminatively represent a given text. In addition, most existing CNN models extract features in a fixed way (i.e., max pooling) that either limit the CNN to local optimum nor without considering the relation between all features, thereby unable to learn a contextual n-gram features adaptively. In this article, we propose a discriminative CNN with context-Aware attention to solve the challenges of vanilla CNN. Specifically, our model mainly encourages discrimination across different filters via maximizing their earth mover distances and estimates the salience of feature candidates by considering the relation between context features. We validate carefully our findings against baselines on five benchmark datasets of classification and two datasets of summarization. The results of the experiments verify the competitive performance of our proposed model.

源语言	英语
文章编号	57
期刊	ACM Transactions on Intelligent Systems and Technology
卷	11
期	5
DOI	https://doi.org/10.1145/3397464
出版状态	已出版 - 9月 2020

访问文件

10.1145/3397464

其它文件与链接

链接到 Scopus 的出版物

引用此

Zhou, Y., Liao, L., Gao, Y., Huang, H., & Wei, X. (2020). A Discriminative Convolutional Neural Network with Context-Aware Attention. ACM Transactions on Intelligent Systems and Technology, 11(5), 文章 57. https://doi.org/10.1145/3397464

@article{3919e7eac5ba48349dc82c08118fee42,

title = "A Discriminative Convolutional Neural Network with Context-Aware Attention",

abstract = "Feature representation and feature extraction are two crucial procedures in text mining. Convolutional Neural Networks (CNN) have shown overwhelming success for text-mining tasks, since they are capable of efficiently extracting n-gram features from source data. However, vanilla CNN has its own weaknesses on feature representation and feature extraction. A certain amount of filters in CNN are inevitably duplicate and thus hinder to discriminatively represent a given text. In addition, most existing CNN models extract features in a fixed way (i.e., max pooling) that either limit the CNN to local optimum nor without considering the relation between all features, thereby unable to learn a contextual n-gram features adaptively. In this article, we propose a discriminative CNN with context-Aware attention to solve the challenges of vanilla CNN. Specifically, our model mainly encourages discrimination across different filters via maximizing their earth mover distances and estimates the salience of feature candidates by considering the relation between context features. We validate carefully our findings against baselines on five benchmark datasets of classification and two datasets of summarization. The results of the experiments verify the competitive performance of our proposed model.",

keywords = "Text mining, attention method, convolution neural networks",

author = "Yuxiang Zhou and Lejian Liao and Yang Gao and Heyan Huang and Xiaochi Wei",

note = "Publisher Copyright: {\textcopyright} 2020 ACM.",

year = "2020",

month = sep,

doi = "10.1145/3397464",

language = "English",

volume = "11",

journal = "ACM Transactions on Intelligent Systems and Technology",

issn = "2157-6904",

publisher = "Association for Computing Machinery (ACM)",

number = "5",

}

TY - JOUR

T1 - A Discriminative Convolutional Neural Network with Context-Aware Attention

AU - Zhou, Yuxiang

AU - Liao, Lejian

AU - Gao, Yang

AU - Huang, Heyan

AU - Wei, Xiaochi

PY - 2020/9

Y1 - 2020/9

N2 - Feature representation and feature extraction are two crucial procedures in text mining. Convolutional Neural Networks (CNN) have shown overwhelming success for text-mining tasks, since they are capable of efficiently extracting n-gram features from source data. However, vanilla CNN has its own weaknesses on feature representation and feature extraction. A certain amount of filters in CNN are inevitably duplicate and thus hinder to discriminatively represent a given text. In addition, most existing CNN models extract features in a fixed way (i.e., max pooling) that either limit the CNN to local optimum nor without considering the relation between all features, thereby unable to learn a contextual n-gram features adaptively. In this article, we propose a discriminative CNN with context-Aware attention to solve the challenges of vanilla CNN. Specifically, our model mainly encourages discrimination across different filters via maximizing their earth mover distances and estimates the salience of feature candidates by considering the relation between context features. We validate carefully our findings against baselines on five benchmark datasets of classification and two datasets of summarization. The results of the experiments verify the competitive performance of our proposed model.

AB - Feature representation and feature extraction are two crucial procedures in text mining. Convolutional Neural Networks (CNN) have shown overwhelming success for text-mining tasks, since they are capable of efficiently extracting n-gram features from source data. However, vanilla CNN has its own weaknesses on feature representation and feature extraction. A certain amount of filters in CNN are inevitably duplicate and thus hinder to discriminatively represent a given text. In addition, most existing CNN models extract features in a fixed way (i.e., max pooling) that either limit the CNN to local optimum nor without considering the relation between all features, thereby unable to learn a contextual n-gram features adaptively. In this article, we propose a discriminative CNN with context-Aware attention to solve the challenges of vanilla CNN. Specifically, our model mainly encourages discrimination across different filters via maximizing their earth mover distances and estimates the salience of feature candidates by considering the relation between context features. We validate carefully our findings against baselines on five benchmark datasets of classification and two datasets of summarization. The results of the experiments verify the competitive performance of our proposed model.

KW - Text mining

KW - attention method

KW - convolution neural networks

UR - http://www.scopus.com/inward/record.url?scp=85091022316&partnerID=8YFLogxK

U2 - 10.1145/3397464

DO - 10.1145/3397464

M3 - Article

AN - SCOPUS:85091022316

SN - 2157-6904

VL - 11

JO - ACM Transactions on Intelligent Systems and Technology

JF - ACM Transactions on Intelligent Systems and Technology

IS - 5

M1 - 57

ER -

A Discriminative Convolutional Neural Network with Context-Aware Attention

摘要

访问文件

其它文件与链接

指纹

引用此