Cached Embedding with Random Selection: Optimization Technique to Improve Training Speed of Character-Aware Embedding

Yaofei Yang, Hua Ping Zhang*, Linfang Wu, Xin Liu, Yangsen Zhang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Embedding is widely used in most natural language processing. e.g., neural machine translation, text classification, text abstraction and sentiment analysis etc. Word-based embedding is faster and character-based embedding performs better. In this paper, we explore a way to combine these two embeddings to bridge the gap between word-based and character-based embedding in speed and performance. In the experiments and analysis of Hybrid Embedding, we found it’s difficult to make these two different embeddings generate the same embedding vector, but we still obtain a comparable result. According to the results of analysis, we explore a form of character-based embedding called Cached Embedding that can achieve almost the same performance and reduce the extra training time by almost half compared to character-based embedding.

源语言英语
主期刊名Intelligent Information and Database Systems - 12th Asian Conference, ACIIDS 2020, Proceedings
编辑Ngoc Thanh Nguyen, Bogdan Trawinski, Kietikul Jearanaitanakij, Suphamit Chittayasothorn, Ali Selamat
出版商Springer
51-62
页数12
ISBN(印刷版)9783030419639
DOI
出版状态已出版 - 2020
活动12th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2020 - Phuket, 泰国
期限: 23 3月 202026 3月 2020

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
12033 LNAI
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议12th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2020
国家/地区泰国
Phuket
时期23/03/2026/03/20

指纹

探究 'Cached Embedding with Random Selection: Optimization Technique to Improve Training Speed of Character-Aware Embedding' 的科研主题。它们共同构成独一无二的指纹。

引用此