Domain structure-based transfer learning for cross-domain word representation

Heyan Huang; Qian Liu

doi:10.1016/j.inffus.2021.05.013

Domain structure-based transfer learning for cross-domain word representation

Heyan Huang, Qian Liu^*

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

7 Citations (Scopus)

Abstract

Cross-domain word representation aims to learn high-quality semantic representations in an under-resourced domain by leveraging information in a resourceful domain. However, most existing methods mainly transfer the semantics of common words across domains, ignoring the semantic relations among domain-specific words. In this paper, we propose a domain structure-based transfer learning method to learn cross-domain representations by leveraging the relations among domain-specific words. To accomplish this, we first construct a semantic graph to capture the latent domain structure using domain-specific co-occurrence information. Then, in the domain adaptation process, beyond domain alignment, we employ Laplacian Eigenmaps to ensure the domain structure is consistently distributed in the learned embedding space. As such, the learned cross-domain word representations not only capture shared semantics across domains, but also maintain the latent domain structure. We performed extensive experiments on two tasks, namely sentiment analysis and query expansion. The experiment results show the effectiveness of our method for tasks in under-resourced domains.

Original language	English
Pages (from-to)	145-156
Number of pages	12
Journal	Information Fusion
Volume	76
DOIs	https://doi.org/10.1016/j.inffus.2021.05.013
Publication status	Published - Dec 2021

Keywords

Semantic structure
Transfer learning
Word representation

Access to Document

10.1016/j.inffus.2021.05.013

Cite this

@article{2ad92d2830ba4955beff746490cdb74b,

title = "Domain structure-based transfer learning for cross-domain word representation",

abstract = "Cross-domain word representation aims to learn high-quality semantic representations in an under-resourced domain by leveraging information in a resourceful domain. However, most existing methods mainly transfer the semantics of common words across domains, ignoring the semantic relations among domain-specific words. In this paper, we propose a domain structure-based transfer learning method to learn cross-domain representations by leveraging the relations among domain-specific words. To accomplish this, we first construct a semantic graph to capture the latent domain structure using domain-specific co-occurrence information. Then, in the domain adaptation process, beyond domain alignment, we employ Laplacian Eigenmaps to ensure the domain structure is consistently distributed in the learned embedding space. As such, the learned cross-domain word representations not only capture shared semantics across domains, but also maintain the latent domain structure. We performed extensive experiments on two tasks, namely sentiment analysis and query expansion. The experiment results show the effectiveness of our method for tasks in under-resourced domains.",

keywords = "Semantic structure, Transfer learning, Word representation",

author = "Heyan Huang and Qian Liu",

note = "Publisher Copyright: {\textcopyright} 2021",

year = "2021",

month = dec,

doi = "10.1016/j.inffus.2021.05.013",

language = "English",

volume = "76",

pages = "145--156",

journal = "Information Fusion",

issn = "1566-2535",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Domain structure-based transfer learning for cross-domain word representation

AU - Huang, Heyan

AU - Liu, Qian

PY - 2021/12

Y1 - 2021/12

N2 - Cross-domain word representation aims to learn high-quality semantic representations in an under-resourced domain by leveraging information in a resourceful domain. However, most existing methods mainly transfer the semantics of common words across domains, ignoring the semantic relations among domain-specific words. In this paper, we propose a domain structure-based transfer learning method to learn cross-domain representations by leveraging the relations among domain-specific words. To accomplish this, we first construct a semantic graph to capture the latent domain structure using domain-specific co-occurrence information. Then, in the domain adaptation process, beyond domain alignment, we employ Laplacian Eigenmaps to ensure the domain structure is consistently distributed in the learned embedding space. As such, the learned cross-domain word representations not only capture shared semantics across domains, but also maintain the latent domain structure. We performed extensive experiments on two tasks, namely sentiment analysis and query expansion. The experiment results show the effectiveness of our method for tasks in under-resourced domains.

AB - Cross-domain word representation aims to learn high-quality semantic representations in an under-resourced domain by leveraging information in a resourceful domain. However, most existing methods mainly transfer the semantics of common words across domains, ignoring the semantic relations among domain-specific words. In this paper, we propose a domain structure-based transfer learning method to learn cross-domain representations by leveraging the relations among domain-specific words. To accomplish this, we first construct a semantic graph to capture the latent domain structure using domain-specific co-occurrence information. Then, in the domain adaptation process, beyond domain alignment, we employ Laplacian Eigenmaps to ensure the domain structure is consistently distributed in the learned embedding space. As such, the learned cross-domain word representations not only capture shared semantics across domains, but also maintain the latent domain structure. We performed extensive experiments on two tasks, namely sentiment analysis and query expansion. The experiment results show the effectiveness of our method for tasks in under-resourced domains.

KW - Semantic structure

KW - Transfer learning

KW - Word representation

UR - http://www.scopus.com/inward/record.url?scp=85107642156&partnerID=8YFLogxK

U2 - 10.1016/j.inffus.2021.05.013

DO - 10.1016/j.inffus.2021.05.013

M3 - Article

AN - SCOPUS:85107642156

SN - 1566-2535

VL - 76

SP - 145

EP - 156

JO - Information Fusion

JF - Information Fusion

ER -

Domain structure-based transfer learning for cross-domain word representation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this