Consistent snapshot algorithms for in-memory database systems: Experiments and analysis

Liang Li; Guoren Wang; Gang Wu; Ye Yuan

doi:10.1109/ICDE.2018.00131

Consistent snapshot algorithms for in-memory database systems: Experiments and analysis

Liang Li, Guoren Wang, Gang Wu, Ye Yuan

School of Computer Science and Technology

Northeastern University China

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

12 Citations (Scopus)

Abstract

In-memory databases (IMDBs) are gaining increasing popularity in big data applications, where clients commit updates intensively. Consistent snapshot is a key step in backup and recovery of IMDBs, thus an important factor for system performance of IMDBs. Formally, the in-memory consistent snapshot problem refers to taking an in-memory consistent time-in-point snapshot with the constraints that 1) clients can read the latest data items, and 2) any data item in the snapshot should not be overwritten. Various snapshot algorithms have been proposed in the academia to trade off throughput and latency, yet industrial IMDBs such as Redis still stick to the simple fork algorithm. As an understanding of this phenomenon, we conduct comprehensive performance evaluations on mainstream snapshot algorithms. Surprisingly, we observe that the simple fork algorithm indeed outperforms the state-of-The-Arts in update-intensive workload scenarios. On this basis, we identify the drawbacks of existing research and propose two lightweight improvements. Extensive evaluations on synthetic data and Redis show that our lightweight improvements yield better performance than fork, the current industrial standard, and the representative snapshot algorithms from the academia. Finally, we have opensourced the implementation of all the above snapshot algorithms to facilitate practitioners to benchmark the performance of each algorithm and select proper methods for different application scenarios.

Original language	English
Title of host publication	Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	1264-1267
Number of pages	4
ISBN (Electronic)	9781538655207
DOIs	https://doi.org/10.1109/ICDE.2018.00131
Publication status	Published - 24 Oct 2018
Event	34th IEEE International Conference on Data Engineering, ICDE 2018 - Paris, France Duration: 16 Apr 2018 → 19 Apr 2018

Publication series

Name	Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018

Conference

Conference	34th IEEE International Conference on Data Engineering, ICDE 2018
Country/Territory	France
City	Paris
Period	16/04/18 → 19/04/18

Keywords

Consistent Snapshot
In Memory databases

Access to Document

10.1109/ICDE.2018.00131

Cite this

Li, L., Wang, G., Wu, G., & Yuan, Y. (2018). Consistent snapshot algorithms for in-memory database systems: Experiments and analysis. In Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018 (pp. 1264-1267). Article 8509352 (Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICDE.2018.00131

Li, Liang ; Wang, Guoren ; Wu, Gang et al. / Consistent snapshot algorithms for in-memory database systems : Experiments and analysis. Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018. Institute of Electrical and Electronics Engineers Inc., 2018. pp. 1264-1267 (Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018).

@inproceedings{e21660c967cb404cb754e4813c945c56,

title = "Consistent snapshot algorithms for in-memory database systems: Experiments and analysis",

abstract = "In-memory databases (IMDBs) are gaining increasing popularity in big data applications, where clients commit updates intensively. Consistent snapshot is a key step in backup and recovery of IMDBs, thus an important factor for system performance of IMDBs. Formally, the in-memory consistent snapshot problem refers to taking an in-memory consistent time-in-point snapshot with the constraints that 1) clients can read the latest data items, and 2) any data item in the snapshot should not be overwritten. Various snapshot algorithms have been proposed in the academia to trade off throughput and latency, yet industrial IMDBs such as Redis still stick to the simple fork algorithm. As an understanding of this phenomenon, we conduct comprehensive performance evaluations on mainstream snapshot algorithms. Surprisingly, we observe that the simple fork algorithm indeed outperforms the state-of-The-Arts in update-intensive workload scenarios. On this basis, we identify the drawbacks of existing research and propose two lightweight improvements. Extensive evaluations on synthetic data and Redis show that our lightweight improvements yield better performance than fork, the current industrial standard, and the representative snapshot algorithms from the academia. Finally, we have opensourced the implementation of all the above snapshot algorithms to facilitate practitioners to benchmark the performance of each algorithm and select proper methods for different application scenarios.",

keywords = "Consistent Snapshot, In Memory databases",

author = "Liang Li and Guoren Wang and Gang Wu and Ye Yuan",

note = "Publisher Copyright: {\textcopyright} 2018 IEEE.; 34th IEEE International Conference on Data Engineering, ICDE 2018 ; Conference date: 16-04-2018 Through 19-04-2018",

year = "2018",

month = oct,

day = "24",

doi = "10.1109/ICDE.2018.00131",

language = "English",

series = "Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "1264--1267",

booktitle = "Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018",

address = "United States",

}

Li, L, Wang, G, Wu, G & Yuan, Y 2018, Consistent snapshot algorithms for in-memory database systems: Experiments and analysis. in Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018., 8509352, Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018, Institute of Electrical and Electronics Engineers Inc., pp. 1264-1267, 34th IEEE International Conference on Data Engineering, ICDE 2018, Paris, France, 16/04/18. https://doi.org/10.1109/ICDE.2018.00131

Consistent snapshot algorithms for in-memory database systems: Experiments and analysis. / Li, Liang; Wang, Guoren; Wu, Gang et al.
Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018. Institute of Electrical and Electronics Engineers Inc., 2018. p. 1264-1267 8509352 (Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Consistent snapshot algorithms for in-memory database systems

T2 - 34th IEEE International Conference on Data Engineering, ICDE 2018

AU - Li, Liang

AU - Wang, Guoren

AU - Wu, Gang

AU - Yuan, Ye

PY - 2018/10/24

Y1 - 2018/10/24

N2 - In-memory databases (IMDBs) are gaining increasing popularity in big data applications, where clients commit updates intensively. Consistent snapshot is a key step in backup and recovery of IMDBs, thus an important factor for system performance of IMDBs. Formally, the in-memory consistent snapshot problem refers to taking an in-memory consistent time-in-point snapshot with the constraints that 1) clients can read the latest data items, and 2) any data item in the snapshot should not be overwritten. Various snapshot algorithms have been proposed in the academia to trade off throughput and latency, yet industrial IMDBs such as Redis still stick to the simple fork algorithm. As an understanding of this phenomenon, we conduct comprehensive performance evaluations on mainstream snapshot algorithms. Surprisingly, we observe that the simple fork algorithm indeed outperforms the state-of-The-Arts in update-intensive workload scenarios. On this basis, we identify the drawbacks of existing research and propose two lightweight improvements. Extensive evaluations on synthetic data and Redis show that our lightweight improvements yield better performance than fork, the current industrial standard, and the representative snapshot algorithms from the academia. Finally, we have opensourced the implementation of all the above snapshot algorithms to facilitate practitioners to benchmark the performance of each algorithm and select proper methods for different application scenarios.

AB - In-memory databases (IMDBs) are gaining increasing popularity in big data applications, where clients commit updates intensively. Consistent snapshot is a key step in backup and recovery of IMDBs, thus an important factor for system performance of IMDBs. Formally, the in-memory consistent snapshot problem refers to taking an in-memory consistent time-in-point snapshot with the constraints that 1) clients can read the latest data items, and 2) any data item in the snapshot should not be overwritten. Various snapshot algorithms have been proposed in the academia to trade off throughput and latency, yet industrial IMDBs such as Redis still stick to the simple fork algorithm. As an understanding of this phenomenon, we conduct comprehensive performance evaluations on mainstream snapshot algorithms. Surprisingly, we observe that the simple fork algorithm indeed outperforms the state-of-The-Arts in update-intensive workload scenarios. On this basis, we identify the drawbacks of existing research and propose two lightweight improvements. Extensive evaluations on synthetic data and Redis show that our lightweight improvements yield better performance than fork, the current industrial standard, and the representative snapshot algorithms from the academia. Finally, we have opensourced the implementation of all the above snapshot algorithms to facilitate practitioners to benchmark the performance of each algorithm and select proper methods for different application scenarios.

KW - Consistent Snapshot

KW - In Memory databases

UR - http://www.scopus.com/inward/record.url?scp=85057111178&partnerID=8YFLogxK

U2 - 10.1109/ICDE.2018.00131

DO - 10.1109/ICDE.2018.00131

M3 - Conference contribution

AN - SCOPUS:85057111178

T3 - Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018

SP - 1264

EP - 1267

BT - Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 16 April 2018 through 19 April 2018

ER -

Li L, Wang G, Wu G, Yuan Y. Consistent snapshot algorithms for in-memory database systems: Experiments and analysis. In Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018. Institute of Electrical and Electronics Engineers Inc. 2018. p. 1264-1267. 8509352. (Proceedings - IEEE 34th International Conference on Data Engineering, ICDE 2018). doi: 10.1109/ICDE.2018.00131

Consistent snapshot algorithms for in-memory database systems: Experiments and analysis

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this