Domain-specific meta-embedding with latent semantic structures

Qian Liu; Jie Lu; Guangquan Zhang; Tao Shen; Zhihan Zhang; Heyan Huang

doi:10.1016/j.ins.2020.10.030

Domain-specific meta-embedding with latent semantic structures

Qian Liu, Jie Lu^*, Guangquan Zhang, Tao Shen, Zhihan Zhang, Heyan Huang

^*此作品的通讯作者

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

19 引用（Scopus）

摘要

Meta-embedding aims at assembling pre-trained embeddings from various sources and producing more expressively powerful word representations. Many natural language processing (NLP) tasks in a specific domain benefit from meta-embedding, especially when the task suffers from low resources. This paper proposes an unsupervised meta-embedding method that jointly models background knowledge from the source embeddings and domain-specific knowledge from the task domain. Specifically, embeddings from multiple sources for a word are dynamically aggregated to a single meta-embedding by a differentiable attention module. The embeddings derived from pre-training on a large-scale corpus provide complete background knowledge of word usage. Then, the meta-embedding is further enriched by exploring domain-specific knowledge from each task domain in two ways. First, contextual information in the raw corpus is considered to capture the semantics of words. Second, a graph representing domain-specific semantic structures is extracted from the raw corpus to highlight the relationships between salient words, then the graph is modeled by a powerful graph convolution network to effectively capture rich semantic structures among words in the task domain. Experiments conducted on two tasks, i.e., text classification and relation extraction, show that our model outputs more accurate word meta-embeddings for the task domain, compared to other state-of-the-art competitors.

源语言	英语
页（从-至）	410-423
页数	14
期刊	Information Sciences
卷	555
DOI	https://doi.org/10.1016/j.ins.2020.10.030
出版状态	已出版 - 5月 2021

访问文件

10.1016/j.ins.2020.10.030

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{db9b161cadab4ebc8064372e7eceffec,

title = "Domain-specific meta-embedding with latent semantic structures",

abstract = "Meta-embedding aims at assembling pre-trained embeddings from various sources and producing more expressively powerful word representations. Many natural language processing (NLP) tasks in a specific domain benefit from meta-embedding, especially when the task suffers from low resources. This paper proposes an unsupervised meta-embedding method that jointly models background knowledge from the source embeddings and domain-specific knowledge from the task domain. Specifically, embeddings from multiple sources for a word are dynamically aggregated to a single meta-embedding by a differentiable attention module. The embeddings derived from pre-training on a large-scale corpus provide complete background knowledge of word usage. Then, the meta-embedding is further enriched by exploring domain-specific knowledge from each task domain in two ways. First, contextual information in the raw corpus is considered to capture the semantics of words. Second, a graph representing domain-specific semantic structures is extracted from the raw corpus to highlight the relationships between salient words, then the graph is modeled by a powerful graph convolution network to effectively capture rich semantic structures among words in the task domain. Experiments conducted on two tasks, i.e., text classification and relation extraction, show that our model outputs more accurate word meta-embeddings for the task domain, compared to other state-of-the-art competitors.",

keywords = "Graph neural network, Meta-embedding, Natural language processing, Semantic representation",

author = "Qian Liu and Jie Lu and Guangquan Zhang and Tao Shen and Zhihan Zhang and Heyan Huang",

note = "Publisher Copyright: {\textcopyright} 2020 Elsevier Inc.",

year = "2021",

month = may,

doi = "10.1016/j.ins.2020.10.030",

language = "English",

volume = "555",

pages = "410--423",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Domain-specific meta-embedding with latent semantic structures

AU - Liu, Qian

AU - Lu, Jie

AU - Zhang, Guangquan

AU - Shen, Tao

AU - Zhang, Zhihan

AU - Huang, Heyan

PY - 2021/5

Y1 - 2021/5

N2 - Meta-embedding aims at assembling pre-trained embeddings from various sources and producing more expressively powerful word representations. Many natural language processing (NLP) tasks in a specific domain benefit from meta-embedding, especially when the task suffers from low resources. This paper proposes an unsupervised meta-embedding method that jointly models background knowledge from the source embeddings and domain-specific knowledge from the task domain. Specifically, embeddings from multiple sources for a word are dynamically aggregated to a single meta-embedding by a differentiable attention module. The embeddings derived from pre-training on a large-scale corpus provide complete background knowledge of word usage. Then, the meta-embedding is further enriched by exploring domain-specific knowledge from each task domain in two ways. First, contextual information in the raw corpus is considered to capture the semantics of words. Second, a graph representing domain-specific semantic structures is extracted from the raw corpus to highlight the relationships between salient words, then the graph is modeled by a powerful graph convolution network to effectively capture rich semantic structures among words in the task domain. Experiments conducted on two tasks, i.e., text classification and relation extraction, show that our model outputs more accurate word meta-embeddings for the task domain, compared to other state-of-the-art competitors.

AB - Meta-embedding aims at assembling pre-trained embeddings from various sources and producing more expressively powerful word representations. Many natural language processing (NLP) tasks in a specific domain benefit from meta-embedding, especially when the task suffers from low resources. This paper proposes an unsupervised meta-embedding method that jointly models background knowledge from the source embeddings and domain-specific knowledge from the task domain. Specifically, embeddings from multiple sources for a word are dynamically aggregated to a single meta-embedding by a differentiable attention module. The embeddings derived from pre-training on a large-scale corpus provide complete background knowledge of word usage. Then, the meta-embedding is further enriched by exploring domain-specific knowledge from each task domain in two ways. First, contextual information in the raw corpus is considered to capture the semantics of words. Second, a graph representing domain-specific semantic structures is extracted from the raw corpus to highlight the relationships between salient words, then the graph is modeled by a powerful graph convolution network to effectively capture rich semantic structures among words in the task domain. Experiments conducted on two tasks, i.e., text classification and relation extraction, show that our model outputs more accurate word meta-embeddings for the task domain, compared to other state-of-the-art competitors.

KW - Graph neural network

KW - Meta-embedding

KW - Natural language processing

KW - Semantic representation

UR - http://www.scopus.com/inward/record.url?scp=85099633840&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2020.10.030

DO - 10.1016/j.ins.2020.10.030

M3 - Article

AN - SCOPUS:85099633840

SN - 0020-0255

VL - 555

SP - 410

EP - 423

JO - Information Sciences

JF - Information Sciences

ER -

Domain-specific meta-embedding with latent semantic structures

摘要

访问文件

其它文件与链接

指纹

引用此