TBSGM: A fast subgraph matching method on large scale graphs

Fusheng Jin; Yifeng Yang; Shuliang Wang; Ye Xue; Zhen Yan

doi:10.4018/IJDWM.2018100104

TBSGM: A fast subgraph matching method on large scale graphs

Fusheng Jin, Yifeng Yang, Shuliang Wang, Ye Xue, Zhen Yan

School of Computer Science and Technology

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Subgraph matching, which belongs to NP-hard, faces significant challenges on a large scale graph with billions of nodes, and existing methods are usually confronted with greater challenges from both stability and efficiency. In this article, a subgraph matching method in a distributed system, tree model-based subgraph matching method (TBSGM) is proposed. The authors provide a transformed efficient query tree as a replacement for a query graph. In order to get the tree, they present a cost evaluation model which may help to generate the efficient query tree according to network communication-cost and calculation-cost evaluation. Also, a key set based indexing strategy for intermediate results is given to simplify the matching results during network communication. Extensive experiments with real-world datasets show that TBSGM significantly outperforms other methods in the aspects of scalability and efficiency.

Original language	English
Pages (from-to)	67-89
Number of pages	23
Journal	International Journal of Data Warehousing and Mining
Volume	14
Issue number	4
DOIs	https://doi.org/10.4018/IJDWM.2018100104
Publication status	Published - 1 Oct 2018

Keywords

Efficient Query Tree
Indexing Strategy
Large Scale Graph
Subgraph Matching

Access to Document

10.4018/IJDWM.2018100104

Cite this

Jin, F., Yang, Y., Wang, S., Xue, Y., & Yan, Z. (2018). TBSGM: A fast subgraph matching method on large scale graphs. International Journal of Data Warehousing and Mining, 14(4), 67-89. https://doi.org/10.4018/IJDWM.2018100104

@article{db04c52e373f47dd8059e25bc23df0fe,

title = "TBSGM: A fast subgraph matching method on large scale graphs",

abstract = "Subgraph matching, which belongs to NP-hard, faces significant challenges on a large scale graph with billions of nodes, and existing methods are usually confronted with greater challenges from both stability and efficiency. In this article, a subgraph matching method in a distributed system, tree model-based subgraph matching method (TBSGM) is proposed. The authors provide a transformed efficient query tree as a replacement for a query graph. In order to get the tree, they present a cost evaluation model which may help to generate the efficient query tree according to network communication-cost and calculation-cost evaluation. Also, a key set based indexing strategy for intermediate results is given to simplify the matching results during network communication. Extensive experiments with real-world datasets show that TBSGM significantly outperforms other methods in the aspects of scalability and efficiency.",

keywords = "Efficient Query Tree, Indexing Strategy, Large Scale Graph, Subgraph Matching",

author = "Fusheng Jin and Yifeng Yang and Shuliang Wang and Ye Xue and Zhen Yan",

note = "Publisher Copyright: Copyright {\textcopyright} 2018, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.",

year = "2018",

month = oct,

day = "1",

doi = "10.4018/IJDWM.2018100104",

language = "English",

volume = "14",

pages = "67--89",

journal = "International Journal of Data Warehousing and Mining",

issn = "1548-3924",

publisher = "IGI Publishing",

number = "4",

}

TY - JOUR

T1 - TBSGM

T2 - A fast subgraph matching method on large scale graphs

AU - Jin, Fusheng

AU - Yang, Yifeng

AU - Wang, Shuliang

AU - Xue, Ye

AU - Yan, Zhen

PY - 2018/10/1

Y1 - 2018/10/1

N2 - Subgraph matching, which belongs to NP-hard, faces significant challenges on a large scale graph with billions of nodes, and existing methods are usually confronted with greater challenges from both stability and efficiency. In this article, a subgraph matching method in a distributed system, tree model-based subgraph matching method (TBSGM) is proposed. The authors provide a transformed efficient query tree as a replacement for a query graph. In order to get the tree, they present a cost evaluation model which may help to generate the efficient query tree according to network communication-cost and calculation-cost evaluation. Also, a key set based indexing strategy for intermediate results is given to simplify the matching results during network communication. Extensive experiments with real-world datasets show that TBSGM significantly outperforms other methods in the aspects of scalability and efficiency.

AB - Subgraph matching, which belongs to NP-hard, faces significant challenges on a large scale graph with billions of nodes, and existing methods are usually confronted with greater challenges from both stability and efficiency. In this article, a subgraph matching method in a distributed system, tree model-based subgraph matching method (TBSGM) is proposed. The authors provide a transformed efficient query tree as a replacement for a query graph. In order to get the tree, they present a cost evaluation model which may help to generate the efficient query tree according to network communication-cost and calculation-cost evaluation. Also, a key set based indexing strategy for intermediate results is given to simplify the matching results during network communication. Extensive experiments with real-world datasets show that TBSGM significantly outperforms other methods in the aspects of scalability and efficiency.

KW - Efficient Query Tree

KW - Indexing Strategy

KW - Large Scale Graph

KW - Subgraph Matching

UR - http://www.scopus.com/inward/record.url?scp=85054294484&partnerID=8YFLogxK

U2 - 10.4018/IJDWM.2018100104

DO - 10.4018/IJDWM.2018100104

M3 - Article

AN - SCOPUS:85054294484

SN - 1548-3924

VL - 14

SP - 67

EP - 89

JO - International Journal of Data Warehousing and Mining

JF - International Journal of Data Warehousing and Mining

IS - 4

ER -

TBSGM: A fast subgraph matching method on large scale graphs

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this