跳到主要导航 跳到搜索 跳到主要内容

Optimization Method of Projection and Order for Multiple Tables Join

  • Fengbo Zong
  • , Yuhai Zhao*
  • , Guoren Wang
  • , Hangxu Ji
  • *此作品的通讯作者
  • Northeastern University China

科研成果: 期刊稿件文章同行评审

摘要

Multiple tables join operation is a common operation in big data processing. Similar to the common Join operations in database operations, the order of multiple tables join operation will have a great impact on the consumption of computing resources and transmission resources. The optimization of the join order of multiple tables is a classical optimization problem, and the size of the projection result of the table in each join will also affect the data volume transmitted between nodes. Therefore, the overall connection order and the projection relationship of each connection will have a significant impact on the join efficiency. But in the traditional optimization strategy, the choice of intermediate projection relation, and the influence on the optimal join strategy based on the intermediate projection relation are often not considered. In order to solve this problem, this paper establishes a connection relation index, which can adjust the projection relation of each join in the construction optimization connection strategy, delete redundant columns in time, and reduce the consumption of transmission resources. At the same time, the optimization strategy of adjusting join order based on projection relation can reduce the consumption of transmission resources and computing resources as much as possible. After the implementation in the Flink system, the optimization strategy is tested, and the results show that it has a significant optimization effect.

源语言英语
页(从-至)106-119
页数14
期刊Journal of Frontiers of Computer Science and Technology
16
1
DOI
出版状态已出版 - 1 1月 2022

指纹

探究 'Optimization Method of Projection and Order for Multiple Tables Join' 的科研主题。它们共同构成独一无二的指纹。

引用此