Extracting salient features from convolutional discriminative filters

Yuxiang Zhou; Lejian Liao; Yang Gao; Heyan Huang

doi:10.1016/j.ins.2020.12.084

Extracting salient features from convolutional discriminative filters

Yuxiang Zhou, Lejian Liao, Yang Gao^*, Heyan Huang

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

Convolutional neural networks (CNN) have been widely used in various tasks, largely due to their ability to efficiently extract n-gram features for text analysis and document representation. In this paper, we intend to insight the CNN model regarding its capability on text analysis. Vanilla CNNs do have weaknesses when it comes to the representation and feature extraction. Duplicate filters are inevitable with vanilla CNNs, which reduces the discriminative power of the representations. In addition, the current pooling operations either limit the CNN to the local optimum (i.e., max pooling) or they do not consider the importance of all features (i.e., mean pooling). In this paper, we propose two modules for vanilla CNNs to overcome these shortcomings. The first equips the CNN with discriminative filters (distinct filters with maximised divergence) and the second provides the ability to comprehensively extract all salient features. Specifically, our model increases the discriminative power of the model by maximizing the distance between different filters, and a novel global pooling mechanism for feature extraction. Validation tests against state-of-the-art baselines on five benchmark classification datasets achieve the competitive performance of our proposed model. Furthermore, visualization on upgrade filters and pooling features verify our hypothesis that the proposed model can receive discriminative filters and salient features.

源语言	英语
页（从-至）	265-279
页数	15
期刊	Information Sciences
卷	558
DOI	https://doi.org/10.1016/j.ins.2020.12.084
出版状态	已出版 - 5月 2021

访问文件

10.1016/j.ins.2020.12.084

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{174f07896eb54c8780b81f62e9a396c2,

title = "Extracting salient features from convolutional discriminative filters",

abstract = "Convolutional neural networks (CNN) have been widely used in various tasks, largely due to their ability to efficiently extract n-gram features for text analysis and document representation. In this paper, we intend to insight the CNN model regarding its capability on text analysis. Vanilla CNNs do have weaknesses when it comes to the representation and feature extraction. Duplicate filters are inevitable with vanilla CNNs, which reduces the discriminative power of the representations. In addition, the current pooling operations either limit the CNN to the local optimum (i.e., max pooling) or they do not consider the importance of all features (i.e., mean pooling). In this paper, we propose two modules for vanilla CNNs to overcome these shortcomings. The first equips the CNN with discriminative filters (distinct filters with maximised divergence) and the second provides the ability to comprehensively extract all salient features. Specifically, our model increases the discriminative power of the model by maximizing the distance between different filters, and a novel global pooling mechanism for feature extraction. Validation tests against state-of-the-art baselines on five benchmark classification datasets achieve the competitive performance of our proposed model. Furthermore, visualization on upgrade filters and pooling features verify our hypothesis that the proposed model can receive discriminative filters and salient features.",

keywords = "Convolutional neural network (CNN), Discriminative filters, Document representation, Pooling mechanism, Salient feature, Text classification",

author = "Yuxiang Zhou and Lejian Liao and Yang Gao and Heyan Huang",

note = "Publisher Copyright: {\textcopyright} 2021 The Author(s)",

year = "2021",

month = may,

doi = "10.1016/j.ins.2020.12.084",

language = "English",

volume = "558",

pages = "265--279",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Extracting salient features from convolutional discriminative filters

AU - Zhou, Yuxiang

AU - Liao, Lejian

AU - Gao, Yang

AU - Huang, Heyan

PY - 2021/5

Y1 - 2021/5

N2 - Convolutional neural networks (CNN) have been widely used in various tasks, largely due to their ability to efficiently extract n-gram features for text analysis and document representation. In this paper, we intend to insight the CNN model regarding its capability on text analysis. Vanilla CNNs do have weaknesses when it comes to the representation and feature extraction. Duplicate filters are inevitable with vanilla CNNs, which reduces the discriminative power of the representations. In addition, the current pooling operations either limit the CNN to the local optimum (i.e., max pooling) or they do not consider the importance of all features (i.e., mean pooling). In this paper, we propose two modules for vanilla CNNs to overcome these shortcomings. The first equips the CNN with discriminative filters (distinct filters with maximised divergence) and the second provides the ability to comprehensively extract all salient features. Specifically, our model increases the discriminative power of the model by maximizing the distance between different filters, and a novel global pooling mechanism for feature extraction. Validation tests against state-of-the-art baselines on five benchmark classification datasets achieve the competitive performance of our proposed model. Furthermore, visualization on upgrade filters and pooling features verify our hypothesis that the proposed model can receive discriminative filters and salient features.

AB - Convolutional neural networks (CNN) have been widely used in various tasks, largely due to their ability to efficiently extract n-gram features for text analysis and document representation. In this paper, we intend to insight the CNN model regarding its capability on text analysis. Vanilla CNNs do have weaknesses when it comes to the representation and feature extraction. Duplicate filters are inevitable with vanilla CNNs, which reduces the discriminative power of the representations. In addition, the current pooling operations either limit the CNN to the local optimum (i.e., max pooling) or they do not consider the importance of all features (i.e., mean pooling). In this paper, we propose two modules for vanilla CNNs to overcome these shortcomings. The first equips the CNN with discriminative filters (distinct filters with maximised divergence) and the second provides the ability to comprehensively extract all salient features. Specifically, our model increases the discriminative power of the model by maximizing the distance between different filters, and a novel global pooling mechanism for feature extraction. Validation tests against state-of-the-art baselines on five benchmark classification datasets achieve the competitive performance of our proposed model. Furthermore, visualization on upgrade filters and pooling features verify our hypothesis that the proposed model can receive discriminative filters and salient features.

KW - Convolutional neural network (CNN)

KW - Discriminative filters

KW - Document representation

KW - Pooling mechanism

KW - Salient feature

KW - Text classification

UR - http://www.scopus.com/inward/record.url?scp=85100880513&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2020.12.084

DO - 10.1016/j.ins.2020.12.084

M3 - Article

AN - SCOPUS:85100880513

SN - 0020-0255

VL - 558

SP - 265

EP - 279

JO - Information Sciences

JF - Information Sciences

ER -

Extracting salient features from convolutional discriminative filters

摘要

访问文件

其它文件与链接

指纹

引用此