Distributed xml twig query processing using mapreduce

  • Xin Bi
  • , Guoren Wang
  • , Xiangguo Zhao
  • , Zhen Zhang
  • , Shuang Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Twig query processing is one of the core operations of XML queries. Centralized holistic twig algorithms suffer great efficiency losses when large-scale XML documents are partitioned and stored in the cloud. Previous work on distributed twig query processing have some limitations, e.g., utter dependence on priori knowledge of query patterns, iteration of MapReduce jobs, etc. In this paper, our arbitrary XML partitioning and storage strategy require no knowledge of query pattern; twig queries can be efficiently processed in a single-round MapReduce job with good scalability. Extensive experiments are conducted to verify the efficiency and scalability of our algorithms.

Original languageEnglish
Title of host publicationWeb Technologies and Applications - 17th Asia-PacificWeb Conference,APWeb 2015, Proceedings
EditorsReynold Cheng, Bin Cui, Zhenjie Zhang, Ruichu Cai, Jia Xu
PublisherSpringer Verlag
Pages203-214
Number of pages12
ISBN (Print)9783319252544
DOIs
Publication statusPublished - 2015
Externally publishedYes
Event17th Asia-PacificWeb Conference, APWeb 2015 - Guangzhou, China
Duration: 18 Sept 201520 Sept 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9313
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th Asia-PacificWeb Conference, APWeb 2015
Country/TerritoryChina
CityGuangzhou
Period18/09/1520/09/15

Fingerprint

Dive into the research topics of 'Distributed xml twig query processing using mapreduce'. Together they form a unique fingerprint.

Cite this