TY - GEN
T1 - Energy aware data layout for de-duplication system
AU - Yan, Fang
AU - Tan, Yuan
AU - Liang, Qinggang
AU - Xing, Ningning
AU - Wang, Yaolei
AU - Zhang, Xiang
PY - 2012
Y1 - 2012
N2 - Achieving power efficiency is a very critical issue in today's data centers and other information processing and storage facilities. In this paper we focus on data layout strategies for conserving energy in de-duplication storage systems. We introduce a method to find the least-cost configuration that meets known targets for de-duplication performance (as well as fault-tolerance) while archiving energy efficiency. We have built the Power-Aware De-duplication System (PADS) to reduce energy use of conventional disk based de-duplication storage systems. PADS turns on just enough disk spindles by properly estimating average chunk size, that is, we can modulate the storage disk provision according to the load fluctuation. Simultaneously, by using S-RAID[10] based data layout, disk accesses are concentrated in a part of the disks in a long time which is conducive to scheduling other disks into standby or shutdown mode. Without specialized hardware, the system can still meet performance demands. Our proposed approach reduces energy consumption of de-duplication storage system, makes de-duplication to fully integrated with the underlying storage mechanism.
AB - Achieving power efficiency is a very critical issue in today's data centers and other information processing and storage facilities. In this paper we focus on data layout strategies for conserving energy in de-duplication storage systems. We introduce a method to find the least-cost configuration that meets known targets for de-duplication performance (as well as fault-tolerance) while archiving energy efficiency. We have built the Power-Aware De-duplication System (PADS) to reduce energy use of conventional disk based de-duplication storage systems. PADS turns on just enough disk spindles by properly estimating average chunk size, that is, we can modulate the storage disk provision according to the load fluctuation. Simultaneously, by using S-RAID[10] based data layout, disk accesses are concentrated in a part of the disks in a long time which is conducive to scheduling other disks into standby or shutdown mode. Without specialized hardware, the system can still meet performance demands. Our proposed approach reduces energy consumption of de-duplication storage system, makes de-duplication to fully integrated with the underlying storage mechanism.
KW - Energy Aware
KW - data de-duplicaiton
KW - data layout
UR - http://www.scopus.com/inward/record.url?scp=84884656727&partnerID=8YFLogxK
U2 - 10.1109/PDCAT.2012.72
DO - 10.1109/PDCAT.2012.72
M3 - Conference contribution
AN - SCOPUS:84884656727
SN - 9780769548791
T3 - Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings
SP - 511
EP - 516
BT - Proceedings - 13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012
T2 - 13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012
Y2 - 14 December 2012 through 16 December 2012
ER -