跳到主要导航 跳到搜索 跳到主要内容

A Multi-Way Spatial Join Querying Processing Algorithm Based on Spark

  • Baiyou Qiao
  • , Junhai Zhu
  • , Yujie Zheng
  • , Muchuan Shen
  • , Guoren Wang
  • Northeastern University China
  • Brigham Young University

科研成果: 期刊稿件文章同行评审

摘要

Aiming at the problem of spatial join query processing in cloud computing systems, a multi-way spatial join query processing algorithm BSMWSJ is proposed, which is based on Spark platform. In this algorithm, the whole data space is divided into grid cells with the same size by grid partition method, and spatial objects in each type data set are distributed into these grid cells according to their spatial locations. Spatial objects in different grid cells are processed in parallel. In multi-way spatial join query processing, a boundary filtering method is proposed to filter the useless data, which calculates the MBRs of the candidate results generated by the previous join processing, and uses these MBRs to filter the subsequent join data sets. This allows it to filter out the useless spatial objects, and reduce the redundant projection and replication of spatial objects. At the same time, a duplication avoidance strategy is applied to reduce the outputs of redundant results, and further minimizes the cost of the subsequent join processing. Many experiments on synthetic and real data sets show that the proposed multi-way spatial join query processing algorithm BSMWSJ has obvious advantages and better performance than the existing multi-way spatial join query processing algorithms.

源语言英语
页(从-至)1592-1602
页数11
期刊Jisuanji Yanjiu yu Fazhan/Computer Research and Development
54
7
DOI
出版状态已出版 - 1 7月 2017
已对外发布

指纹

探究 'A Multi-Way Spatial Join Querying Processing Algorithm Based on Spark' 的科研主题。它们共同构成独一无二的指纹。

引用此