ELF: shared Cache management through eliminating dead blocks and filtering less reused lines

Xiu Feng Sui; Jun Min Wu; Guo Liang Chen; Yi Xuan Tang

doi:10.3724/SP.J.1016.2011.00143

ELF: shared Cache management through eliminating dead blocks and filtering less reused lines

Xiu Feng Sui^*, Jun Min Wu, Guo Liang Chen, Yi Xuan Tang

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

2 引用（Scopus）

摘要

Modern CMP processors usually employ shared last-level cache (LLC) based on LRU replacement policy and its approximations. However, as the LLC grows in capacity and associativity, the performance gap between the LRU and the theoretical optimal replacement algorithms has widened. Various alternative cache management technologies have been proposed to resolve this problem, but they only cover a single type of memory access behavior, and exploit little frequency information of cache accesses, thus have limited performance benefits. In this paper, we propose a unified cache management policy ELF which can cover a variety of memory behaviors and exploit both recency and frequency information of a program simultaneously. Motivatedbythe observation that cache blocks often exhibit a small number of uses during their life time in the LLC, ELF is designed to (1) predict dead lines through a counter-based mechanism and evict them early, (2) filter less reused blocks through dynamic insertion and promotion policies. Thereby, the potentially live blocks are retained and most of the working set keeps undisturbed in the ELF managed L2 cache. Our evaluation on 4-way CMPs shows that ELF improves the overall performance by 14.5% on average over the LRU policy, and the performance benefit of ELF is 1.06x compared to PIPP and 1.09x compared to TADIP.

源语言	英语
页（从-至）	143-153
页数	11
期刊	Jisuanji Xuebao/Chinese Journal of Computers
卷	34
期	1
DOI	https://doi.org/10.3724/SP.J.1016.2011.00143
出版状态	已出版 - 1月 2011
已对外发布	是

访问文件

10.3724/SP.J.1016.2011.00143

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{91bfab0e5974409a98506f124473a3f9,

title = "ELF: shared Cache management through eliminating dead blocks and filtering less reused lines",

abstract = "Modern CMP processors usually employ shared last-level cache (LLC) based on LRU replacement policy and its approximations. However, as the LLC grows in capacity and associativity, the performance gap between the LRU and the theoretical optimal replacement algorithms has widened. Various alternative cache management technologies have been proposed to resolve this problem, but they only cover a single type of memory access behavior, and exploit little frequency information of cache accesses, thus have limited performance benefits. In this paper, we propose a unified cache management policy ELF which can cover a variety of memory behaviors and exploit both recency and frequency information of a program simultaneously. Motivatedbythe observation that cache blocks often exhibit a small number of uses during their life time in the LLC, ELF is designed to (1) predict dead lines through a counter-based mechanism and evict them early, (2) filter less reused blocks through dynamic insertion and promotion policies. Thereby, the potentially live blocks are retained and most of the working set keeps undisturbed in the ELF managed L2 cache. Our evaluation on 4-way CMPs shows that ELF improves the overall performance by 14.5% on average over the LRU policy, and the performance benefit of ELF is 1.06x compared to PIPP and 1.09x compared to TADIP.",

keywords = "Counter-based algorithms, Insertion policy, Multi-core, Replacement algorithms, Shared cache",

author = "Sui, {Xiu Feng} and Wu, {Jun Min} and Chen, {Guo Liang} and Tang, {Yi Xuan}",

year = "2011",

month = jan,

doi = "10.3724/SP.J.1016.2011.00143",

language = "English",

volume = "34",

pages = "143--153",

journal = "Jisuanji Xuebao/Chinese Journal of Computers",

issn = "0254-4164",

publisher = "Science China Press",

number = "1",

}

TY - JOUR

T1 - ELF

T2 - shared Cache management through eliminating dead blocks and filtering less reused lines

AU - Sui, Xiu Feng

AU - Wu, Jun Min

AU - Chen, Guo Liang

AU - Tang, Yi Xuan

PY - 2011/1

Y1 - 2011/1

N2 - Modern CMP processors usually employ shared last-level cache (LLC) based on LRU replacement policy and its approximations. However, as the LLC grows in capacity and associativity, the performance gap between the LRU and the theoretical optimal replacement algorithms has widened. Various alternative cache management technologies have been proposed to resolve this problem, but they only cover a single type of memory access behavior, and exploit little frequency information of cache accesses, thus have limited performance benefits. In this paper, we propose a unified cache management policy ELF which can cover a variety of memory behaviors and exploit both recency and frequency information of a program simultaneously. Motivatedbythe observation that cache blocks often exhibit a small number of uses during their life time in the LLC, ELF is designed to (1) predict dead lines through a counter-based mechanism and evict them early, (2) filter less reused blocks through dynamic insertion and promotion policies. Thereby, the potentially live blocks are retained and most of the working set keeps undisturbed in the ELF managed L2 cache. Our evaluation on 4-way CMPs shows that ELF improves the overall performance by 14.5% on average over the LRU policy, and the performance benefit of ELF is 1.06x compared to PIPP and 1.09x compared to TADIP.

AB - Modern CMP processors usually employ shared last-level cache (LLC) based on LRU replacement policy and its approximations. However, as the LLC grows in capacity and associativity, the performance gap between the LRU and the theoretical optimal replacement algorithms has widened. Various alternative cache management technologies have been proposed to resolve this problem, but they only cover a single type of memory access behavior, and exploit little frequency information of cache accesses, thus have limited performance benefits. In this paper, we propose a unified cache management policy ELF which can cover a variety of memory behaviors and exploit both recency and frequency information of a program simultaneously. Motivatedbythe observation that cache blocks often exhibit a small number of uses during their life time in the LLC, ELF is designed to (1) predict dead lines through a counter-based mechanism and evict them early, (2) filter less reused blocks through dynamic insertion and promotion policies. Thereby, the potentially live blocks are retained and most of the working set keeps undisturbed in the ELF managed L2 cache. Our evaluation on 4-way CMPs shows that ELF improves the overall performance by 14.5% on average over the LRU policy, and the performance benefit of ELF is 1.06x compared to PIPP and 1.09x compared to TADIP.

KW - Counter-based algorithms

KW - Insertion policy

KW - Multi-core

KW - Replacement algorithms

KW - Shared cache

UR - http://www.scopus.com/inward/record.url?scp=79953110685&partnerID=8YFLogxK

U2 - 10.3724/SP.J.1016.2011.00143

DO - 10.3724/SP.J.1016.2011.00143

M3 - Article

AN - SCOPUS:79953110685

SN - 0254-4164

VL - 34

SP - 143

EP - 153

JO - Jisuanji Xuebao/Chinese Journal of Computers

JF - Jisuanji Xuebao/Chinese Journal of Computers

IS - 1

ER -

ELF: shared Cache management through eliminating dead blocks and filtering less reused lines

摘要

访问文件

其它文件与链接

指纹

引用此