HeGCL: Advance Self-Supervised Learning in Heterogeneous Graph-Level Representation

Gen Shi; Yifan Zhu; Jian K. Liu; Xuesong Li

doi:10.1109/TNNLS.2023.3273255

HeGCL: Advance Self-Supervised Learning in Heterogeneous Graph-Level Representation

Gen Shi, Yifan Zhu, Jian K. Liu, Xuesong Li^*

^*Corresponding author for this work

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

6 Citations (Scopus)

Abstract

Representation learning in heterogeneous graphs with massive unlabeled data has aroused great interest. The heterogeneity of graphs not only contains rich information, but also raises difficult barriers to designing unsupervised or self-supervised learning (SSL) strategies. Existing methods such as random walk-based approaches are mainly dependent on the proximity information of neighbors and lack the ability to integrate node features into a higher-level representation. Furthermore, previous self-supervised or unsupervised frameworks are usually designed for node-level tasks, which are commonly short of capturing global graph properties and may not perform well in graph-level tasks. Therefore, a label-free framework that can better capture the global properties of heterogeneous graphs is urgently required. In this article, we propose a self-supervised heterogeneous graph neural network (GNN) based on cross-view contrastive learning (HeGCL). The HeGCL presents two views for encoding heterogeneous graphs: the meta-path view and the outline view. Compared with the meta-path view that provides semantic information, the outline view encodes the complex edge relations and captures graph-level properties by using a nonlocal block. Thus, the HeGCL learns node embeddings through maximizing mutual information (MI) between global and semantic representations coming from the outline and meta-path view, respectively. Experiments on both node-level and graph-level tasks show the superiority of the proposed model over other methods, and further exploration studies also show that the introduction of nonlocal block brings a significant contribution to graph-level tasks.

Original language	English
Pages (from-to)	13914-13925
Number of pages	12
Journal	IEEE Transactions on Neural Networks and Learning Systems
Volume	35
Issue number	10
DOIs	https://doi.org/10.1109/TNNLS.2023.3273255
Publication status	Published - 2024

Keywords

Graph neural networks (GNNs)
heterogeneous graphs
self-supervised learning (SSL)

Access to Document

10.1109/TNNLS.2023.3273255

Cite this

@article{e6505221a1504ac4b5dbe1b9a264c2d6,

title = "HeGCL: Advance Self-Supervised Learning in Heterogeneous Graph-Level Representation",

abstract = "Representation learning in heterogeneous graphs with massive unlabeled data has aroused great interest. The heterogeneity of graphs not only contains rich information, but also raises difficult barriers to designing unsupervised or self-supervised learning (SSL) strategies. Existing methods such as random walk-based approaches are mainly dependent on the proximity information of neighbors and lack the ability to integrate node features into a higher-level representation. Furthermore, previous self-supervised or unsupervised frameworks are usually designed for node-level tasks, which are commonly short of capturing global graph properties and may not perform well in graph-level tasks. Therefore, a label-free framework that can better capture the global properties of heterogeneous graphs is urgently required. In this article, we propose a self-supervised heterogeneous graph neural network (GNN) based on cross-view contrastive learning (HeGCL). The HeGCL presents two views for encoding heterogeneous graphs: the meta-path view and the outline view. Compared with the meta-path view that provides semantic information, the outline view encodes the complex edge relations and captures graph-level properties by using a nonlocal block. Thus, the HeGCL learns node embeddings through maximizing mutual information (MI) between global and semantic representations coming from the outline and meta-path view, respectively. Experiments on both node-level and graph-level tasks show the superiority of the proposed model over other methods, and further exploration studies also show that the introduction of nonlocal block brings a significant contribution to graph-level tasks.",

keywords = "Graph neural networks (GNNs), heterogeneous graphs, self-supervised learning (SSL)",

author = "Gen Shi and Yifan Zhu and Liu, {Jian K.} and Xuesong Li",

note = "Publisher Copyright: {\textcopyright} 2012 IEEE.",

year = "2024",

doi = "10.1109/TNNLS.2023.3273255",

language = "English",

volume = "35",

pages = "13914--13925",

journal = "IEEE Transactions on Neural Networks and Learning Systems",

issn = "2162-237X",

publisher = "IEEE Computational Intelligence Society",

number = "10",

}

TY - JOUR

T1 - HeGCL

T2 - Advance Self-Supervised Learning in Heterogeneous Graph-Level Representation

AU - Shi, Gen

AU - Zhu, Yifan

AU - Liu, Jian K.

AU - Li, Xuesong

PY - 2024

Y1 - 2024

N2 - Representation learning in heterogeneous graphs with massive unlabeled data has aroused great interest. The heterogeneity of graphs not only contains rich information, but also raises difficult barriers to designing unsupervised or self-supervised learning (SSL) strategies. Existing methods such as random walk-based approaches are mainly dependent on the proximity information of neighbors and lack the ability to integrate node features into a higher-level representation. Furthermore, previous self-supervised or unsupervised frameworks are usually designed for node-level tasks, which are commonly short of capturing global graph properties and may not perform well in graph-level tasks. Therefore, a label-free framework that can better capture the global properties of heterogeneous graphs is urgently required. In this article, we propose a self-supervised heterogeneous graph neural network (GNN) based on cross-view contrastive learning (HeGCL). The HeGCL presents two views for encoding heterogeneous graphs: the meta-path view and the outline view. Compared with the meta-path view that provides semantic information, the outline view encodes the complex edge relations and captures graph-level properties by using a nonlocal block. Thus, the HeGCL learns node embeddings through maximizing mutual information (MI) between global and semantic representations coming from the outline and meta-path view, respectively. Experiments on both node-level and graph-level tasks show the superiority of the proposed model over other methods, and further exploration studies also show that the introduction of nonlocal block brings a significant contribution to graph-level tasks.

AB - Representation learning in heterogeneous graphs with massive unlabeled data has aroused great interest. The heterogeneity of graphs not only contains rich information, but also raises difficult barriers to designing unsupervised or self-supervised learning (SSL) strategies. Existing methods such as random walk-based approaches are mainly dependent on the proximity information of neighbors and lack the ability to integrate node features into a higher-level representation. Furthermore, previous self-supervised or unsupervised frameworks are usually designed for node-level tasks, which are commonly short of capturing global graph properties and may not perform well in graph-level tasks. Therefore, a label-free framework that can better capture the global properties of heterogeneous graphs is urgently required. In this article, we propose a self-supervised heterogeneous graph neural network (GNN) based on cross-view contrastive learning (HeGCL). The HeGCL presents two views for encoding heterogeneous graphs: the meta-path view and the outline view. Compared with the meta-path view that provides semantic information, the outline view encodes the complex edge relations and captures graph-level properties by using a nonlocal block. Thus, the HeGCL learns node embeddings through maximizing mutual information (MI) between global and semantic representations coming from the outline and meta-path view, respectively. Experiments on both node-level and graph-level tasks show the superiority of the proposed model over other methods, and further exploration studies also show that the introduction of nonlocal block brings a significant contribution to graph-level tasks.

KW - Graph neural networks (GNNs)

KW - heterogeneous graphs

KW - self-supervised learning (SSL)

UR - http://www.scopus.com/inward/record.url?scp=85161044324&partnerID=8YFLogxK

U2 - 10.1109/TNNLS.2023.3273255

DO - 10.1109/TNNLS.2023.3273255

M3 - Article

AN - SCOPUS:85161044324

SN - 2162-237X

VL - 35

SP - 13914

EP - 13925

JO - IEEE Transactions on Neural Networks and Learning Systems

JF - IEEE Transactions on Neural Networks and Learning Systems

IS - 10

ER -

HeGCL: Advance Self-Supervised Learning in Heterogeneous Graph-Level Representation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this