TY - GEN
T1 - Meta-Path based Text Feature Enrichment Using Knowledge Graph
AU - Ding, Jiayu
AU - Cao, Xiaohuan
AU - Hu, Linmei
AU - Shi, Chuan
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/6
Y1 - 2019/6
N2 - Text feature representation is an important and fundamental problem widely studied in many text analysis tasks such as text classification. However, most of the existing methods on text feature extraction focus on text itself, for example, bagof-words (BOW). In this work, we propose to make use of Knowledge Graphs (KGs) to enrich text representation in a novel HIN perspective. There are two main challenges due to the complexity of KGs. First, how to address the ambiguity when mapping the entities in a text to a KG. Second, how to incorporate the relations of entities in the same document, which indicate the intra-document semantics. To solve these problems, we present a novel Meta-Path Based Text Feature Enrichment (MeTEN) method. The MeTEN can effectively map nouns or noun phrases in a text to entities in a KG, and effectively discover their relations represented by meta paths in the KG through a novel bi-directional meta path generation algorithm. Extensive experiments on real-world datasets demonstrate that MeTEN can effectively enrich text feature and thus improve text classification.
AB - Text feature representation is an important and fundamental problem widely studied in many text analysis tasks such as text classification. However, most of the existing methods on text feature extraction focus on text itself, for example, bagof-words (BOW). In this work, we propose to make use of Knowledge Graphs (KGs) to enrich text representation in a novel HIN perspective. There are two main challenges due to the complexity of KGs. First, how to address the ambiguity when mapping the entities in a text to a KG. Second, how to incorporate the relations of entities in the same document, which indicate the intra-document semantics. To solve these problems, we present a novel Meta-Path Based Text Feature Enrichment (MeTEN) method. The MeTEN can effectively map nouns or noun phrases in a text to entities in a KG, and effectively discover their relations represented by meta paths in the KG through a novel bi-directional meta path generation algorithm. Extensive experiments on real-world datasets demonstrate that MeTEN can effectively enrich text feature and thus improve text classification.
KW - Bi-directional Random Walk
KW - Heterogeneous Information Network
KW - Knowledge Graph
KW - Meta-path
KW - Text Feature
UR - http://www.scopus.com/inward/record.url?scp=85084496316&partnerID=8YFLogxK
U2 - 10.1109/DSC.2019.00106
DO - 10.1109/DSC.2019.00106
M3 - Conference contribution
AN - SCOPUS:85084496316
T3 - Proceedings - 2019 IEEE 4th International Conference on Data Science in Cyberspace, DSC 2019
SP - 649
EP - 655
BT - Proceedings - 2019 IEEE 4th International Conference on Data Science in Cyberspace, DSC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th IEEE International Conference on Data Science in Cyberspace, DSC 2019
Y2 - 23 June 2019 through 25 June 2019
ER -