AlphaQO: 鲁棒的学习型查询优化器

Xiang Yu; Cheng Liang Chai; Xin Ning Zhang; Nan Tang; Ji Sun; Guo Liang Li

doi:10.13328/j.cnki.jos.006452

AlphaQO: 鲁棒的学习型查询优化器

Translated title of the contribution: AlphaQO: Robust Learned Query Optimizer

Xiang Yu, Cheng Liang Chai, Xin Ning Zhang, Nan Tang, Ji Sun, Guo Liang Li^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Learned database query optimizers, which are typically empowered by (deep) learning models, have attracted significant attention recently, because they can offer similar or even better performance than the state-of-the-art commercial optimizers that require hundreds of expert-hours to tune. A crucial factor of successfully training learned optimizers is training queries. Unfortunately, a good query workload that is sufficient for training learned optimizers is not always available. This study proposes a framework, called AlphaQO, on generating queries for learned optimizers with reinforcement learning (RL). AlphaQO is a loop system that consists of two main components, query generator and learned optimizer. Query generator aims at generating “hard” queries (i.e., those queries that the learned optimizer provides poor estimates). The learned optimizer will be trained using generated queries, as well as providing feedbacks (in terms of numerical rewards) to the query generator. If the generated queries are good, the query generator will get a high reward; otherwise, the query generator will get a low reward. The above process is performed iteratively, with the main goal that within a small budget, the learned optimizer can be trained and generalized well to a wide range of unseen queries. Extensive experiments show that AlphaQO can generate a relatively small number of queries and train a learned optimizer to outperform commercial optimizers. Moreover, learned optimizers need much less queries from AlphaQO than randomly generated queries, in order to well train the learned optimizer.

Translated title of the contribution	AlphaQO: Robust Learned Query Optimizer
Original language	Chinese (Traditional)
Pages (from-to)	814-831
Number of pages	18
Journal	Ruan Jian Xue Bao/Journal of Software
Volume	33
Issue number	3
DOIs	https://doi.org/10.13328/j.cnki.jos.006452
Publication status	Published - Mar 2022
Externally published	Yes

Access to Document

10.13328/j.cnki.jos.006452

Cite this

@article{eef45375363a4e0db3e0c39966ec1661,

title = "AlphaQO: 鲁棒的学习型查询优化器",

abstract = "Learned database query optimizers, which are typically empowered by (deep) learning models, have attracted significant attention recently, because they can offer similar or even better performance than the state-of-the-art commercial optimizers that require hundreds of expert-hours to tune. A crucial factor of successfully training learned optimizers is training queries. Unfortunately, a good query workload that is sufficient for training learned optimizers is not always available. This study proposes a framework, called AlphaQO, on generating queries for learned optimizers with reinforcement learning (RL). AlphaQO is a loop system that consists of two main components, query generator and learned optimizer. Query generator aims at generating “hard” queries (i.e., those queries that the learned optimizer provides poor estimates). The learned optimizer will be trained using generated queries, as well as providing feedbacks (in terms of numerical rewards) to the query generator. If the generated queries are good, the query generator will get a high reward; otherwise, the query generator will get a low reward. The above process is performed iteratively, with the main goal that within a small budget, the learned optimizer can be trained and generalized well to a wide range of unseen queries. Extensive experiments show that AlphaQO can generate a relatively small number of queries and train a learned optimizer to outperform commercial optimizers. Moreover, learned optimizers need much less queries from AlphaQO than randomly generated queries, in order to well train the learned optimizer.",

keywords = "AI4DB, Database, Learned optimizer, Query generation, Reinforcement learning, Robustness",

author = "Xiang Yu and Chai, {Cheng Liang} and Zhang, {Xin Ning} and Nan Tang and Ji Sun and Li, {Guo Liang}",

year = "2022",

month = mar,

doi = "10.13328/j.cnki.jos.006452",

language = "繁体中文",

volume = "33",

pages = "814--831",

journal = "Ruan Jian Xue Bao/Journal of Software",

issn = "1000-9825",

publisher = "Chinese Academy of Sciences",

number = "3",

}

TY - JOUR

T1 - AlphaQO

T2 - 鲁棒的学习型查询优化器

AU - Yu, Xiang

AU - Chai, Cheng Liang

AU - Zhang, Xin Ning

AU - Tang, Nan

AU - Sun, Ji

AU - Li, Guo Liang

PY - 2022/3

Y1 - 2022/3

N2 - Learned database query optimizers, which are typically empowered by (deep) learning models, have attracted significant attention recently, because they can offer similar or even better performance than the state-of-the-art commercial optimizers that require hundreds of expert-hours to tune. A crucial factor of successfully training learned optimizers is training queries. Unfortunately, a good query workload that is sufficient for training learned optimizers is not always available. This study proposes a framework, called AlphaQO, on generating queries for learned optimizers with reinforcement learning (RL). AlphaQO is a loop system that consists of two main components, query generator and learned optimizer. Query generator aims at generating “hard” queries (i.e., those queries that the learned optimizer provides poor estimates). The learned optimizer will be trained using generated queries, as well as providing feedbacks (in terms of numerical rewards) to the query generator. If the generated queries are good, the query generator will get a high reward; otherwise, the query generator will get a low reward. The above process is performed iteratively, with the main goal that within a small budget, the learned optimizer can be trained and generalized well to a wide range of unseen queries. Extensive experiments show that AlphaQO can generate a relatively small number of queries and train a learned optimizer to outperform commercial optimizers. Moreover, learned optimizers need much less queries from AlphaQO than randomly generated queries, in order to well train the learned optimizer.

AB - Learned database query optimizers, which are typically empowered by (deep) learning models, have attracted significant attention recently, because they can offer similar or even better performance than the state-of-the-art commercial optimizers that require hundreds of expert-hours to tune. A crucial factor of successfully training learned optimizers is training queries. Unfortunately, a good query workload that is sufficient for training learned optimizers is not always available. This study proposes a framework, called AlphaQO, on generating queries for learned optimizers with reinforcement learning (RL). AlphaQO is a loop system that consists of two main components, query generator and learned optimizer. Query generator aims at generating “hard” queries (i.e., those queries that the learned optimizer provides poor estimates). The learned optimizer will be trained using generated queries, as well as providing feedbacks (in terms of numerical rewards) to the query generator. If the generated queries are good, the query generator will get a high reward; otherwise, the query generator will get a low reward. The above process is performed iteratively, with the main goal that within a small budget, the learned optimizer can be trained and generalized well to a wide range of unseen queries. Extensive experiments show that AlphaQO can generate a relatively small number of queries and train a learned optimizer to outperform commercial optimizers. Moreover, learned optimizers need much less queries from AlphaQO than randomly generated queries, in order to well train the learned optimizer.

KW - AI4DB

KW - Database

KW - Learned optimizer

KW - Query generation

KW - Reinforcement learning

KW - Robustness

UR - http://www.scopus.com/inward/record.url?scp=85126986269&partnerID=8YFLogxK

U2 - 10.13328/j.cnki.jos.006452

DO - 10.13328/j.cnki.jos.006452

M3 - 文章

AN - SCOPUS:85126986269

SN - 1000-9825

VL - 33

SP - 814

EP - 831

JO - Ruan Jian Xue Bao/Journal of Software

JF - Ruan Jian Xue Bao/Journal of Software

IS - 3

ER -

AlphaQO: 鲁棒的学习型查询优化器

Abstract

Access to Document

Other files and links

Fingerprint

Cite this