Cached Embedding with Random Selection: Optimization Technique to Improve Training Speed of Character-Aware Embedding

Yaofei Yang, Hua Ping Zhang*, Linfang Wu, Xin Liu, Yangsen Zhang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Embedding is widely used in most natural language processing. e.g., neural machine translation, text classification, text abstraction and sentiment analysis etc. Word-based embedding is faster and character-based embedding performs better. In this paper, we explore a way to combine these two embeddings to bridge the gap between word-based and character-based embedding in speed and performance. In the experiments and analysis of Hybrid Embedding, we found it’s difficult to make these two different embeddings generate the same embedding vector, but we still obtain a comparable result. According to the results of analysis, we explore a form of character-based embedding called Cached Embedding that can achieve almost the same performance and reduce the extra training time by almost half compared to character-based embedding.

Original languageEnglish
Title of host publicationIntelligent Information and Database Systems - 12th Asian Conference, ACIIDS 2020, Proceedings
EditorsNgoc Thanh Nguyen, Bogdan Trawinski, Kietikul Jearanaitanakij, Suphamit Chittayasothorn, Ali Selamat
PublisherSpringer
Pages51-62
Number of pages12
ISBN (Print)9783030419639
DOIs
Publication statusPublished - 2020
Event12th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2020 - Phuket, Thailand
Duration: 23 Mar 202026 Mar 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12033 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference12th Asian Conference on Intelligent Information and Database Systems, ACIIDS 2020
Country/TerritoryThailand
CityPhuket
Period23/03/2026/03/20

Keywords

  • Cached Embedding
  • Char-aware embedding
  • Linguist
  • Natural language processing
  • Time reduction
  • Training speed
  • Word embedding

Fingerprint

Dive into the research topics of 'Cached Embedding with Random Selection: Optimization Technique to Improve Training Speed of Character-Aware Embedding'. Together they form a unique fingerprint.

Cite this