Skip to main navigation Skip to search Skip to main content

Keyword search over probabilistic XML data

  • Yue Zhao*
  • , Guoren Wang
  • , Ye Yuan
  • , Junxia Wang
  • , Chungang Lin
  • , Ying Yu
  • *Corresponding author for this work
  • Northeastern University China
  • State Grid Corporation of China
  • Middle School
  • Vocational Senior School Jilin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Despite the proliferation of work on XML keyword search, it remains open to support keyword search over uncertain XML data. In this paper, we study the problem of ELCA-based answers over uncertain XML data, which is to retrieve subtrees taking a probability of at least a threshold to be ELCA-based answers. To answer such query efficiently, we employ a filtering-and-verification strategy which is based on a proposed probabilistic inverted index, PrIndex. Based on PrIndex, we develop tight lower and upper bounds that can prune unqualified results very rapidly. After that, we propose an efficient algorithm (PrIndex-based algorithm) that combine probability threshold pruning and probability distribution of node from leaf to root to support keyword search over probabilistic XML data. Extensive experimental results demonstrate the effectiveness of the proposed algorithms.

Original languageEnglish
Title of host publication2015 12th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2015
EditorsZhuo Tang, Jiayi Du, Shu Yin, Renfa Li, Ligang He
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1230-1235
Number of pages6
ISBN (Electronic)9781467376822
DOIs
Publication statusPublished - 13 Jan 2016
Externally publishedYes
Event12th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2015 - Zhangjiajie, China
Duration: 15 Aug 201517 Aug 2015

Publication series

Name2015 12th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2015

Conference

Conference12th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2015
Country/TerritoryChina
CityZhangjiajie
Period15/08/1517/08/15

Keywords

  • keywords search
  • probabilistic XML data
  • probability threshold

Fingerprint

Dive into the research topics of 'Keyword search over probabilistic XML data'. Together they form a unique fingerprint.

Cite this