跳到主要导航 跳到搜索 跳到主要内容

A Data-aware Learned Index Scheme for Efficient Writes

  • Li Liu
  • , Chunhua Li*
  • , Zhou Zhang
  • , Yuhan Liu
  • , Ke Zhou
  • , Ji Zhang
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Index structure is very important for efficient data access and system performance in the storage system. Learned index utilizes recursive index models to replace range index structure (such as B+ Tree) so as to predict the position of a lookup key in a dataset. This new paradigm greatly reduces query time and index size, however it only supports read-only workloads. Although some studies reserve gaps between keys for new data to support update, they incur high memory space and shift cost when a large number of data are inserted. In this paper, we propose a data-aware learned index scheme with high scalability, called EWALI, which constructs index models based on a lightweight data-aware data partition algorithm. When the data distribution changes, EWALI can automatically split the related leaf nodes and retrain the corresponding models to accommodate different workloads. In addition, EWALI designs an alternative duel buffers to handle new data and adopts the delayed update mechanism to merge data, greatly reducing write locking and improving write performance. We evaluate EWALI with real-world and synthetic datasets. Extensive experimental results show that EWALI reduces write latency respectively by 60.9% and 33.7% than state-of-the-art Fitting-Tree and XIndex, and achieves up to 3.1 × performance improvement in terms of range query comparing with XIndex.

源语言英语
主期刊名51st International Conference on Parallel Processing, ICPP 2022 - Main Conference Proceedings
出版商Association for Computing Machinery
ISBN(电子版)9781450397339
DOI
出版状态已出版 - 29 8月 2022
已对外发布
活动51st International Conference on Parallel Processing, ICPP 2022 - Virtual, Online, 法国
期限: 29 8月 20221 9月 2022

出版系列

姓名ACM International Conference Proceeding Series

会议

会议51st International Conference on Parallel Processing, ICPP 2022
国家/地区法国
Virtual, Online
时期29/08/221/09/22

指纹

探究 'A Data-aware Learned Index Scheme for Efficient Writes' 的科研主题。它们共同构成独一无二的指纹。

引用此