Text classification with enriched word features

Jingda Xu, Cheng Zhang, Peng Zhang, Dawei Song*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

Text classification is a fundamental task in natural language processing. Most existing text classification models focus on constructing sophisticated high-level text features but ignore the importance of word features. Those models only use low-level word features obtained from a linear layer as input. To explore how the quality of word representations affects text classification, we propose a deep architecture which can extract high-level word features to perform text classification. Specifically, we use different temporal convolution filters, which vary in size, to capture different contextual features. Then a transition layer is used to coalesce the contextual features and form an enriched high-level word representations. We also find that word feature reuse is useful in our architecture to enrich word representations. Extensive experiments on six publically available datasets show that enriched word representations can significantly improve the performance of classification models.

源语言英语
主期刊名PRICAI 2018
主期刊副标题Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings
编辑Xin Geng, Byeong-Ho Kang
出版商Springer Verlag
274-281
页数8
ISBN(印刷版)9783319973098
DOI
出版状态已出版 - 2018
已对外发布
活动15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018 - Nanjing, 中国
期限: 28 8月 201831 8月 2018

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
11013 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018
国家/地区中国
Nanjing
时期28/08/1831/08/18

指纹

探究 'Text classification with enriched word features' 的科研主题。它们共同构成独一无二的指纹。

引用此