Densely Connected Bidirectional LSTM with Max-Pooling of CNN Network for Text Classification

Qinghong Jiang; Huaping Zhang; Jianyun Shang; Ian Wesson; ENlin N. Li

doi:10.1007/978-3-030-65390-3_8

Densely Connected Bidirectional LSTM with Max-Pooling of CNN Network for Text Classification

Qinghong Jiang^*, Huaping Zhang, Jianyun Shang, Ian Wesson, ENlin N. Li

^*此作品的通讯作者

计算机学院

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Text classification is a fundamental task in natural language processing (NLP). Context semantics can greatly improve the accuracy of text classification tasks. Although there are some popular methods in obtaining semantics, current context semantic analysis techniques, due to limited accuracy, are still a great bottleneck for text classification. This paper introduces a novel model, the densely connected Bidirectional LSTM with Max-pooling of CNN network (Dense-BiLSTM-MP), which greatly enhances the context of semantic information. In this model, a densely connected bidirectional long short-term memory (BiLSTM) model, as well as multiple max-pooling layers of convolutional network, are applied to obtain an increasingly enhanced assessment of context, and extract the key features, respectively. Experiments were conducted on four public datasets: YELP, 20NewsGroup, THUNews and AG. The experimental results show that the proposed model outperforms state of the art methods on several datasets. Furthermore, discussions on the Dense-BiLSTM-MP model’s performance in short texts and long texts were given, respectively.

源语言	英语
主期刊名	Advanced Data Mining and Applications - 16th International Conference, ADMA 2020, Proceedings
编辑	Xiaochun Yang, Chang-Dong Wang, Md. Saiful Islam, Zheng Zhang
出版商	Springer Science and Business Media Deutschland GmbH
页	98-113
页数	16
ISBN（印刷版）	9783030653897
DOI	https://doi.org/10.1007/978-3-030-65390-3_8
出版状态	已出版 - 2020
活动	16th International Conference on Advanced Data Mining and Applications, ADMA 2020 - Foshan, 中国期限: 12 11月 2020 → 14 11月 2020

出版系列

姓名	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
卷	12447 LNAI
ISSN（印刷版）	0302-9743
ISSN（电子版）	1611-3349

会议

会议	16th International Conference on Advanced Data Mining and Applications, ADMA 2020
国家/地区	中国
市	Foshan
时期	12/11/20 → 14/11/20

访问文件

10.1007/978-3-030-65390-3_8

其它文件与链接

链接到 Scopus 的出版物

引用此

Jiang, Q., Zhang, H., Shang, J., Wesson, I., & Li, EN. N. (2020). Densely Connected Bidirectional LSTM with Max-Pooling of CNN Network for Text Classification. 在 X. Yang, C.-D. Wang, M. S. Islam, & Z. Zhang (编辑), Advanced Data Mining and Applications - 16th International Conference, ADMA 2020, Proceedings (页码 98-113). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 12447 LNAI). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-65390-3_8

Jiang, Qinghong ; Zhang, Huaping ; Shang, Jianyun 等. / Densely Connected Bidirectional LSTM with Max-Pooling of CNN Network for Text Classification. Advanced Data Mining and Applications - 16th International Conference, ADMA 2020, Proceedings. 编辑 / Xiaochun Yang ; Chang-Dong Wang ; Md. Saiful Islam ; Zheng Zhang. Springer Science and Business Media Deutschland GmbH, 2020. 页码 98-113 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{893fc285cc3f4dc0ade45d3a64bef307,

title = "Densely Connected Bidirectional LSTM with Max-Pooling of CNN Network for Text Classification",

abstract = "Text classification is a fundamental task in natural language processing (NLP). Context semantics can greatly improve the accuracy of text classification tasks. Although there are some popular methods in obtaining semantics, current context semantic analysis techniques, due to limited accuracy, are still a great bottleneck for text classification. This paper introduces a novel model, the densely connected Bidirectional LSTM with Max-pooling of CNN network (Dense-BiLSTM-MP), which greatly enhances the context of semantic information. In this model, a densely connected bidirectional long short-term memory (BiLSTM) model, as well as multiple max-pooling layers of convolutional network, are applied to obtain an increasingly enhanced assessment of context, and extract the key features, respectively. Experiments were conducted on four public datasets: YELP, 20NewsGroup, THUNews and AG. The experimental results show that the proposed model outperforms state of the art methods on several datasets. Furthermore, discussions on the Dense-BiLSTM-MP model{\textquoteright}s performance in short texts and long texts were given, respectively.",

keywords = "Deep learning, Dense structure, Text classification",

author = "Qinghong Jiang and Huaping Zhang and Jianyun Shang and Ian Wesson and Li, {ENlin N.}",

note = "Publisher Copyright: {\textcopyright} 2020, Springer Nature Switzerland AG.; 16th International Conference on Advanced Data Mining and Applications, ADMA 2020 ; Conference date: 12-11-2020 Through 14-11-2020",

year = "2020",

doi = "10.1007/978-3-030-65390-3_8",

language = "English",

isbn = "9783030653897",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "98--113",

editor = "Xiaochun Yang and Chang-Dong Wang and Islam, {Md. Saiful} and Zheng Zhang",

booktitle = "Advanced Data Mining and Applications - 16th International Conference, ADMA 2020, Proceedings",

address = "Germany",

}

Jiang, Q, Zhang, H, Shang, J, Wesson, I & Li, ENN 2020, Densely Connected Bidirectional LSTM with Max-Pooling of CNN Network for Text Classification. 在 X Yang, C-D Wang, MS Islam & Z Zhang (编辑), Advanced Data Mining and Applications - 16th International Conference, ADMA 2020, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 卷 12447 LNAI, Springer Science and Business Media Deutschland GmbH, 页码 98-113, 16th International Conference on Advanced Data Mining and Applications, ADMA 2020, Foshan, 中国, 12/11/20. https://doi.org/10.1007/978-3-030-65390-3_8

Densely Connected Bidirectional LSTM with Max-Pooling of CNN Network for Text Classification. / Jiang, Qinghong; Zhang, Huaping; Shang, Jianyun 等.
Advanced Data Mining and Applications - 16th International Conference, ADMA 2020, Proceedings. 编辑 / Xiaochun Yang; Chang-Dong Wang; Md. Saiful Islam; Zheng Zhang. Springer Science and Business Media Deutschland GmbH, 2020. 页码 98-113 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 12447 LNAI).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Densely Connected Bidirectional LSTM with Max-Pooling of CNN Network for Text Classification

AU - Jiang, Qinghong

AU - Zhang, Huaping

AU - Shang, Jianyun

AU - Wesson, Ian

AU - Li, ENlin N.

PY - 2020

Y1 - 2020

N2 - Text classification is a fundamental task in natural language processing (NLP). Context semantics can greatly improve the accuracy of text classification tasks. Although there are some popular methods in obtaining semantics, current context semantic analysis techniques, due to limited accuracy, are still a great bottleneck for text classification. This paper introduces a novel model, the densely connected Bidirectional LSTM with Max-pooling of CNN network (Dense-BiLSTM-MP), which greatly enhances the context of semantic information. In this model, a densely connected bidirectional long short-term memory (BiLSTM) model, as well as multiple max-pooling layers of convolutional network, are applied to obtain an increasingly enhanced assessment of context, and extract the key features, respectively. Experiments were conducted on four public datasets: YELP, 20NewsGroup, THUNews and AG. The experimental results show that the proposed model outperforms state of the art methods on several datasets. Furthermore, discussions on the Dense-BiLSTM-MP model’s performance in short texts and long texts were given, respectively.

AB - Text classification is a fundamental task in natural language processing (NLP). Context semantics can greatly improve the accuracy of text classification tasks. Although there are some popular methods in obtaining semantics, current context semantic analysis techniques, due to limited accuracy, are still a great bottleneck for text classification. This paper introduces a novel model, the densely connected Bidirectional LSTM with Max-pooling of CNN network (Dense-BiLSTM-MP), which greatly enhances the context of semantic information. In this model, a densely connected bidirectional long short-term memory (BiLSTM) model, as well as multiple max-pooling layers of convolutional network, are applied to obtain an increasingly enhanced assessment of context, and extract the key features, respectively. Experiments were conducted on four public datasets: YELP, 20NewsGroup, THUNews and AG. The experimental results show that the proposed model outperforms state of the art methods on several datasets. Furthermore, discussions on the Dense-BiLSTM-MP model’s performance in short texts and long texts were given, respectively.

KW - Deep learning

KW - Dense structure

KW - Text classification

UR - http://www.scopus.com/inward/record.url?scp=85101881077&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-65390-3_8

DO - 10.1007/978-3-030-65390-3_8

M3 - Conference contribution

AN - SCOPUS:85101881077

SN - 9783030653897

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 98

EP - 113

BT - Advanced Data Mining and Applications - 16th International Conference, ADMA 2020, Proceedings

A2 - Yang, Xiaochun

A2 - Wang, Chang-Dong

A2 - Islam, Md. Saiful

A2 - Zhang, Zheng

PB - Springer Science and Business Media Deutschland GmbH

T2 - 16th International Conference on Advanced Data Mining and Applications, ADMA 2020

Y2 - 12 November 2020 through 14 November 2020

ER -

Jiang Q, Zhang H, Shang J, Wesson I, Li ENN. Densely Connected Bidirectional LSTM with Max-Pooling of CNN Network for Text Classification. 在 Yang X, Wang CD, Islam MS, Zhang Z, 编辑, Advanced Data Mining and Applications - 16th International Conference, ADMA 2020, Proceedings. Springer Science and Business Media Deutschland GmbH. 2020. 页码 98-113. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-65390-3_8

Densely Connected Bidirectional LSTM with Max-Pooling of CNN Network for Text Classification

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此