Rule-based graph repairing: semantic and efficient repairing methods

科研成果: 书/报告/会议事项章节会议稿件同行评审

28 引用 (Scopus)

摘要

Real-life graph datasets extracted from Web are inevitably full of incompleteness, conflicts, and redundancies, so graph data cleaning shows its necessity. One of the main issues is to automatically repair the graph with some repairing rules. Although rules like data dependencies have been widely studied in relational data repairing, very few works exist to repair the graph data. In this paper, we introduce an automatic repairing semantic for graphs, called Graph-Repairing Rules (GRRs). This semantic can capture the incompleteness, conflicts, and redundancies in the graphs and indicate how to correct these errors. We study three fundamental problems associated with GRRs, implication, consistency and termination, which show whether a given set of GRRs make sense. Repairing the graph data using GRRs involves a problem of finding isomorphic subgraphs of the graph data for each GRR, which is NP-complete. To efficiently circumvent the complex calculation of subgraph isomorphism, we design a decomposition-And-join strategy to solve this problem. Extensive experiments on real datasets show that our GRR semantic and corresponding repairing algorithms can effectively and efficiently repair real-life graph data.

源语言英语
主期刊名Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018
出版商Institute of Electrical and Electronics Engineers Inc.
773-784
页数12
ISBN(电子版)9781538655207
DOI
出版状态已出版 - 24 10月 2018
活动34th IEEE International Conference on Data Engineering, ICDE 2018 - Paris, 法国
期限: 16 4月 201819 4月 2018

出版系列

姓名Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018

会议

会议34th IEEE International Conference on Data Engineering, ICDE 2018
国家/地区法国
Paris
时期16/04/1819/04/18

指纹

探究 'Rule-based graph repairing: semantic and efficient repairing methods' 的科研主题。它们共同构成独一无二的指纹。

引用此