Meta-Path based Text Feature Enrichment Using Knowledge Graph

Jiayu Ding, Xiaohuan Cao, Linmei Hu, Chuan Shi

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Text feature representation is an important and fundamental problem widely studied in many text analysis tasks such as text classification. However, most of the existing methods on text feature extraction focus on text itself, for example, bagof-words (BOW). In this work, we propose to make use of Knowledge Graphs (KGs) to enrich text representation in a novel HIN perspective. There are two main challenges due to the complexity of KGs. First, how to address the ambiguity when mapping the entities in a text to a KG. Second, how to incorporate the relations of entities in the same document, which indicate the intra-document semantics. To solve these problems, we present a novel Meta-Path Based Text Feature Enrichment (MeTEN) method. The MeTEN can effectively map nouns or noun phrases in a text to entities in a KG, and effectively discover their relations represented by meta paths in the KG through a novel bi-directional meta path generation algorithm. Extensive experiments on real-world datasets demonstrate that MeTEN can effectively enrich text feature and thus improve text classification.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE 4th International Conference on Data Science in Cyberspace, DSC 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages649-655
Number of pages7
ISBN (Electronic)9781728145280
DOIs
Publication statusPublished - Jun 2019
Externally publishedYes
Event4th IEEE International Conference on Data Science in Cyberspace, DSC 2019 - Hangzhou, China
Duration: 23 Jun 201925 Jun 2019

Publication series

NameProceedings - 2019 IEEE 4th International Conference on Data Science in Cyberspace, DSC 2019
Volume2019-January

Conference

Conference4th IEEE International Conference on Data Science in Cyberspace, DSC 2019
Country/TerritoryChina
CityHangzhou
Period23/06/1925/06/19

Keywords

  • Bi-directional Random Walk
  • Heterogeneous Information Network
  • Knowledge Graph
  • Meta-path
  • Text Feature

Fingerprint

Dive into the research topics of 'Meta-Path based Text Feature Enrichment Using Knowledge Graph'. Together they form a unique fingerprint.

Cite this