Text classification with enriched word features

Jingda Xu; Cheng Zhang; Peng Zhang; Dawei Song

doi:10.1007/978-3-319-97310-4_31

Text classification with enriched word features

Jingda Xu, Cheng Zhang, Peng Zhang, Dawei Song^*

^*此作品的通讯作者

Tianjin University

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

5 引用（Scopus）

摘要

Text classification is a fundamental task in natural language processing. Most existing text classification models focus on constructing sophisticated high-level text features but ignore the importance of word features. Those models only use low-level word features obtained from a linear layer as input. To explore how the quality of word representations affects text classification, we propose a deep architecture which can extract high-level word features to perform text classification. Specifically, we use different temporal convolution filters, which vary in size, to capture different contextual features. Then a transition layer is used to coalesce the contextual features and form an enriched high-level word representations. We also find that word feature reuse is useful in our architecture to enrich word representations. Extensive experiments on six publically available datasets show that enriched word representations can significantly improve the performance of classification models.

源语言	英语
主期刊名	PRICAI 2018
主期刊副标题	Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings
编辑	Xin Geng, Byeong-Ho Kang
出版商	Springer Verlag
页	274-281
页数	8
ISBN（印刷版）	9783319973098
DOI	https://doi.org/10.1007/978-3-319-97310-4_31
出版状态	已出版 - 2018
已对外发布	是
活动	15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018 - Nanjing, 中国期限: 28 8月 2018 → 31 8月 2018

出版系列

姓名	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
卷	11013 LNAI
ISSN（印刷版）	0302-9743
ISSN（电子版）	1611-3349

会议

会议	15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018
国家/地区	中国
市	Nanjing
时期	28/08/18 → 31/08/18

访问文件

10.1007/978-3-319-97310-4_31

其它文件与链接

链接到 Scopus 的出版物

引用此

Xu, J., Zhang, C., Zhang, P., & Song, D. (2018). Text classification with enriched word features. 在 X. Geng, & B.-H. Kang (编辑), PRICAI 2018: Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings (页码 274-281). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 11013 LNAI). Springer Verlag. https://doi.org/10.1007/978-3-319-97310-4_31

Xu, Jingda ; Zhang, Cheng ; Zhang, Peng 等. / Text classification with enriched word features. PRICAI 2018: Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings. 编辑 / Xin Geng ; Byeong-Ho Kang. Springer Verlag, 2018. 页码 274-281 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{370bf4263b8347d48498b6d38fe1c50b,

title = "Text classification with enriched word features",

abstract = "Text classification is a fundamental task in natural language processing. Most existing text classification models focus on constructing sophisticated high-level text features but ignore the importance of word features. Those models only use low-level word features obtained from a linear layer as input. To explore how the quality of word representations affects text classification, we propose a deep architecture which can extract high-level word features to perform text classification. Specifically, we use different temporal convolution filters, which vary in size, to capture different contextual features. Then a transition layer is used to coalesce the contextual features and form an enriched high-level word representations. We also find that word feature reuse is useful in our architecture to enrich word representations. Extensive experiments on six publically available datasets show that enriched word representations can significantly improve the performance of classification models.",

keywords = "Enriched word representation, Temporal convolution, Text classification",

author = "Jingda Xu and Cheng Zhang and Peng Zhang and Dawei Song",

note = "Publisher Copyright: {\textcopyright} Springer International Publishing AG, part of Springer Nature 2018.; 15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018 ; Conference date: 28-08-2018 Through 31-08-2018",

year = "2018",

doi = "10.1007/978-3-319-97310-4_31",

language = "English",

isbn = "9783319973098",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "274--281",

editor = "Xin Geng and Byeong-Ho Kang",

booktitle = "PRICAI 2018",

address = "Germany",

}

Xu, J, Zhang, C, Zhang, P & Song, D 2018, Text classification with enriched word features. 在 X Geng & B-H Kang (编辑), PRICAI 2018: Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 卷 11013 LNAI, Springer Verlag, 页码 274-281, 15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018, Nanjing, 中国, 28/08/18. https://doi.org/10.1007/978-3-319-97310-4_31

Text classification with enriched word features. / Xu, Jingda; Zhang, Cheng; Zhang, Peng 等.
PRICAI 2018: Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings. 编辑 / Xin Geng; Byeong-Ho Kang. Springer Verlag, 2018. 页码 274-281 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 11013 LNAI).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Text classification with enriched word features

AU - Xu, Jingda

AU - Zhang, Cheng

AU - Zhang, Peng

AU - Song, Dawei

N1 - Publisher Copyright: © Springer International Publishing AG, part of Springer Nature 2018.

PY - 2018

Y1 - 2018

N2 - Text classification is a fundamental task in natural language processing. Most existing text classification models focus on constructing sophisticated high-level text features but ignore the importance of word features. Those models only use low-level word features obtained from a linear layer as input. To explore how the quality of word representations affects text classification, we propose a deep architecture which can extract high-level word features to perform text classification. Specifically, we use different temporal convolution filters, which vary in size, to capture different contextual features. Then a transition layer is used to coalesce the contextual features and form an enriched high-level word representations. We also find that word feature reuse is useful in our architecture to enrich word representations. Extensive experiments on six publically available datasets show that enriched word representations can significantly improve the performance of classification models.

AB - Text classification is a fundamental task in natural language processing. Most existing text classification models focus on constructing sophisticated high-level text features but ignore the importance of word features. Those models only use low-level word features obtained from a linear layer as input. To explore how the quality of word representations affects text classification, we propose a deep architecture which can extract high-level word features to perform text classification. Specifically, we use different temporal convolution filters, which vary in size, to capture different contextual features. Then a transition layer is used to coalesce the contextual features and form an enriched high-level word representations. We also find that word feature reuse is useful in our architecture to enrich word representations. Extensive experiments on six publically available datasets show that enriched word representations can significantly improve the performance of classification models.

KW - Enriched word representation

KW - Temporal convolution

KW - Text classification

UR - http://www.scopus.com/inward/record.url?scp=85051949978&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-97310-4_31

DO - 10.1007/978-3-319-97310-4_31

M3 - Conference contribution

AN - SCOPUS:85051949978

SN - 9783319973098

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 274

EP - 281

BT - PRICAI 2018

A2 - Geng, Xin

A2 - Kang, Byeong-Ho

PB - Springer Verlag

T2 - 15th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2018

Y2 - 28 August 2018 through 31 August 2018

ER -

Xu J, Zhang C, Zhang P, Song D. Text classification with enriched word features. 在 Geng X, Kang BH, 编辑, PRICAI 2018: Trends in Artificial Intelligence - 15th Pacific Rim International Conference on Artificial Intelligence, Proceedings. Springer Verlag. 2018. 页码 274-281. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-97310-4_31

Text classification with enriched word features

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此