位置社交网络上的图表示学习

Lin Lin Zhao; An Biao Wu; Ye Yuan; Yang Li; Guo Ren Wang

doi:10.11897/SP.J.1016.2022.00838

位置社交网络上的图表示学习

Lin Lin Zhao, An Biao Wu, Ye Yuan^*, Yang Li, Guo Ren Wang

^*此作品的通讯作者

计算机学院

Northeastern University China

科研成果: 期刊稿件 › 文章 › 同行评审

1 引用（Scopus）

摘要

With the popularity of online Social Networks, location-based social networks (LBSN) have accumulated massive data and have been widely used in the research of mining user behavior preferences due to their rich Spatio-Temporal and semantic information. Nevertheless, the traditional manual extraction of LBSN features is limited and time-consuming. In recent years, graph representation learning has been successfully applied to the modeling and representation of various graph structure data such as recommendation systems and knowledge maps, demonstrating its powerful non-linear fitting and representation learning capabilities. However, most of the existing Graph representation learning studies focus on static and homogeneous networks, and it is difficult to simultaneously combine time, location information, and social relationships to capture the complex structure and user preferences in LBSN, which makes it difficult to extract effective information from LBSN. Therefore, this paper proposes a two-stage Graph representation learning framework TGE-LBSN (Two Stages of Graph Embedding on LBSN) for LBSN, which transforms LBSN into a heterogeneous network, and automatically extracts the features of LBSN with the help of graph representation learning to obtain nodes' vector representation with sufficiently rich information and utilize the prediction and recommendation tasks in the social domain to verify its effectiveness. First of all, biased sampling is carried out on the check-in hyperedge of LBSN according to users' Check-in time. In the first stage, the IVGS (Initial Vector Generation Stage) algorithm is designed, the friendship edges and the Check-in super edges are used to jointly generate nodes' vectors containing position and feature information by IVGS algorithm. The generated nodes' vectors are used as the input of the second stage. In addition, the second stage is mainly responsible for generating the final nodes' vectors in LBSN. LBSN is divided into different subgraphs according to the Check-in time, and we design the LBSN-oriented SAN (Select Aggregated Neighbors) strategy which is used to select representative neighbors to complete the aggregation operation, and then use the subgraph vector generation algorithm SVG (Subgraph Vector Generation) to obtain the vector representation of the nodes in each subgraph. Finally, the loss function is set according to the downstream tasks, and the attention mechanism is also used to learn adaptive weights for the subgraphs in different time periods to obtain the final vectors of the nodes, and then we use the final nodes' vectors to complete various prediction tasks in the social domain. Plenty of comparative experiments are carried out with the benchmark methods on the real LBSN data sets and on the time series social network, respectively, we use ROC curve as the evaluation standard. Extensive experimental results verify that the proposed algorithm TGE-LBSN outperforms other benchmark methods, and it can efficiently extract the effective information of LBSN and retain it in the embedding vector of the node. Specifically, in terms of friendship prediction, the AUC value can be increased by up to 42% compared with the existing models. On the point of interest recommendation task, the AUC value can be increased by up to 7% compared with the benchmark algorithm.

投稿的翻译标题	Graph representation learning on Location-Based Social Networks
源语言	繁体中文
页（从-至）	838-857
页数	20
期刊	Jisuanji Xuebao/Chinese Journal of Computers
卷	45
期	4
DOI	https://doi.org/10.11897/SP.J.1016.2022.00838
出版状态	已出版 - 4月 2022

关键词

Attention mechanism
Graph embedding
Heterogeneous network representation learning
Link prediction

访问文件

10.11897/SP.J.1016.2022.00838

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{54f03b4901164815ad2de60150ac0c11,

title = "位置社交网络上的图表示学习",

abstract = "With the popularity of online Social Networks, location-based social networks (LBSN) have accumulated massive data and have been widely used in the research of mining user behavior preferences due to their rich Spatio-Temporal and semantic information. Nevertheless, the traditional manual extraction of LBSN features is limited and time-consuming. In recent years, graph representation learning has been successfully applied to the modeling and representation of various graph structure data such as recommendation systems and knowledge maps, demonstrating its powerful non-linear fitting and representation learning capabilities. However, most of the existing Graph representation learning studies focus on static and homogeneous networks, and it is difficult to simultaneously combine time, location information, and social relationships to capture the complex structure and user preferences in LBSN, which makes it difficult to extract effective information from LBSN. Therefore, this paper proposes a two-stage Graph representation learning framework TGE-LBSN (Two Stages of Graph Embedding on LBSN) for LBSN, which transforms LBSN into a heterogeneous network, and automatically extracts the features of LBSN with the help of graph representation learning to obtain nodes' vector representation with sufficiently rich information and utilize the prediction and recommendation tasks in the social domain to verify its effectiveness. First of all, biased sampling is carried out on the check-in hyperedge of LBSN according to users' Check-in time. In the first stage, the IVGS (Initial Vector Generation Stage) algorithm is designed, the friendship edges and the Check-in super edges are used to jointly generate nodes' vectors containing position and feature information by IVGS algorithm. The generated nodes' vectors are used as the input of the second stage. In addition, the second stage is mainly responsible for generating the final nodes' vectors in LBSN. LBSN is divided into different subgraphs according to the Check-in time, and we design the LBSN-oriented SAN (Select Aggregated Neighbors) strategy which is used to select representative neighbors to complete the aggregation operation, and then use the subgraph vector generation algorithm SVG (Subgraph Vector Generation) to obtain the vector representation of the nodes in each subgraph. Finally, the loss function is set according to the downstream tasks, and the attention mechanism is also used to learn adaptive weights for the subgraphs in different time periods to obtain the final vectors of the nodes, and then we use the final nodes' vectors to complete various prediction tasks in the social domain. Plenty of comparative experiments are carried out with the benchmark methods on the real LBSN data sets and on the time series social network, respectively, we use ROC curve as the evaluation standard. Extensive experimental results verify that the proposed algorithm TGE-LBSN outperforms other benchmark methods, and it can efficiently extract the effective information of LBSN and retain it in the embedding vector of the node. Specifically, in terms of friendship prediction, the AUC value can be increased by up to 42% compared with the existing models. On the point of interest recommendation task, the AUC value can be increased by up to 7% compared with the benchmark algorithm.",

keywords = "Attention mechanism, Graph embedding, Heterogeneous network representation learning, Link prediction",

author = "Zhao, {Lin Lin} and Wu, {An Biao} and Ye Yuan and Yang Li and Wang, {Guo Ren}",

year = "2022",

month = apr,

doi = "10.11897/SP.J.1016.2022.00838",

language = "繁体中文",

volume = "45",

pages = "838--857",

journal = "Jisuanji Xuebao/Chinese Journal of Computers",

issn = "0254-4164",

publisher = "Science Press",

number = "4",

}

TY - JOUR

T1 - 位置社交网络上的图表示学习

AU - Zhao, Lin Lin

AU - Wu, An Biao

AU - Yuan, Ye

AU - Li, Yang

AU - Wang, Guo Ren

PY - 2022/4

Y1 - 2022/4

N2 - With the popularity of online Social Networks, location-based social networks (LBSN) have accumulated massive data and have been widely used in the research of mining user behavior preferences due to their rich Spatio-Temporal and semantic information. Nevertheless, the traditional manual extraction of LBSN features is limited and time-consuming. In recent years, graph representation learning has been successfully applied to the modeling and representation of various graph structure data such as recommendation systems and knowledge maps, demonstrating its powerful non-linear fitting and representation learning capabilities. However, most of the existing Graph representation learning studies focus on static and homogeneous networks, and it is difficult to simultaneously combine time, location information, and social relationships to capture the complex structure and user preferences in LBSN, which makes it difficult to extract effective information from LBSN. Therefore, this paper proposes a two-stage Graph representation learning framework TGE-LBSN (Two Stages of Graph Embedding on LBSN) for LBSN, which transforms LBSN into a heterogeneous network, and automatically extracts the features of LBSN with the help of graph representation learning to obtain nodes' vector representation with sufficiently rich information and utilize the prediction and recommendation tasks in the social domain to verify its effectiveness. First of all, biased sampling is carried out on the check-in hyperedge of LBSN according to users' Check-in time. In the first stage, the IVGS (Initial Vector Generation Stage) algorithm is designed, the friendship edges and the Check-in super edges are used to jointly generate nodes' vectors containing position and feature information by IVGS algorithm. The generated nodes' vectors are used as the input of the second stage. In addition, the second stage is mainly responsible for generating the final nodes' vectors in LBSN. LBSN is divided into different subgraphs according to the Check-in time, and we design the LBSN-oriented SAN (Select Aggregated Neighbors) strategy which is used to select representative neighbors to complete the aggregation operation, and then use the subgraph vector generation algorithm SVG (Subgraph Vector Generation) to obtain the vector representation of the nodes in each subgraph. Finally, the loss function is set according to the downstream tasks, and the attention mechanism is also used to learn adaptive weights for the subgraphs in different time periods to obtain the final vectors of the nodes, and then we use the final nodes' vectors to complete various prediction tasks in the social domain. Plenty of comparative experiments are carried out with the benchmark methods on the real LBSN data sets and on the time series social network, respectively, we use ROC curve as the evaluation standard. Extensive experimental results verify that the proposed algorithm TGE-LBSN outperforms other benchmark methods, and it can efficiently extract the effective information of LBSN and retain it in the embedding vector of the node. Specifically, in terms of friendship prediction, the AUC value can be increased by up to 42% compared with the existing models. On the point of interest recommendation task, the AUC value can be increased by up to 7% compared with the benchmark algorithm.

AB - With the popularity of online Social Networks, location-based social networks (LBSN) have accumulated massive data and have been widely used in the research of mining user behavior preferences due to their rich Spatio-Temporal and semantic information. Nevertheless, the traditional manual extraction of LBSN features is limited and time-consuming. In recent years, graph representation learning has been successfully applied to the modeling and representation of various graph structure data such as recommendation systems and knowledge maps, demonstrating its powerful non-linear fitting and representation learning capabilities. However, most of the existing Graph representation learning studies focus on static and homogeneous networks, and it is difficult to simultaneously combine time, location information, and social relationships to capture the complex structure and user preferences in LBSN, which makes it difficult to extract effective information from LBSN. Therefore, this paper proposes a two-stage Graph representation learning framework TGE-LBSN (Two Stages of Graph Embedding on LBSN) for LBSN, which transforms LBSN into a heterogeneous network, and automatically extracts the features of LBSN with the help of graph representation learning to obtain nodes' vector representation with sufficiently rich information and utilize the prediction and recommendation tasks in the social domain to verify its effectiveness. First of all, biased sampling is carried out on the check-in hyperedge of LBSN according to users' Check-in time. In the first stage, the IVGS (Initial Vector Generation Stage) algorithm is designed, the friendship edges and the Check-in super edges are used to jointly generate nodes' vectors containing position and feature information by IVGS algorithm. The generated nodes' vectors are used as the input of the second stage. In addition, the second stage is mainly responsible for generating the final nodes' vectors in LBSN. LBSN is divided into different subgraphs according to the Check-in time, and we design the LBSN-oriented SAN (Select Aggregated Neighbors) strategy which is used to select representative neighbors to complete the aggregation operation, and then use the subgraph vector generation algorithm SVG (Subgraph Vector Generation) to obtain the vector representation of the nodes in each subgraph. Finally, the loss function is set according to the downstream tasks, and the attention mechanism is also used to learn adaptive weights for the subgraphs in different time periods to obtain the final vectors of the nodes, and then we use the final nodes' vectors to complete various prediction tasks in the social domain. Plenty of comparative experiments are carried out with the benchmark methods on the real LBSN data sets and on the time series social network, respectively, we use ROC curve as the evaluation standard. Extensive experimental results verify that the proposed algorithm TGE-LBSN outperforms other benchmark methods, and it can efficiently extract the effective information of LBSN and retain it in the embedding vector of the node. Specifically, in terms of friendship prediction, the AUC value can be increased by up to 42% compared with the existing models. On the point of interest recommendation task, the AUC value can be increased by up to 7% compared with the benchmark algorithm.

KW - Attention mechanism

KW - Graph embedding

KW - Heterogeneous network representation learning

KW - Link prediction

UR - http://www.scopus.com/inward/record.url?scp=85128855845&partnerID=8YFLogxK

U2 - 10.11897/SP.J.1016.2022.00838

DO - 10.11897/SP.J.1016.2022.00838

M3 - 文章

AN - SCOPUS:85128855845

SN - 0254-4164

VL - 45

SP - 838

EP - 857

JO - Jisuanji Xuebao/Chinese Journal of Computers

JF - Jisuanji Xuebao/Chinese Journal of Computers

IS - 4

ER -

位置社交网络上的图表示学习

摘要

关键词

访问文件

其它文件与链接

指纹

引用此