DeepEE: Joint optimization of job scheduling and cooling control for data center energy efficiency using deep reinforcement learning

Yongyi Ran, Han Hu, Xin Zhou, Yonggang Wen

科研成果: 书/报告/会议事项章节会议稿件同行评审

80 引用 (Scopus)

摘要

The past decade witnessed the tremendous growth of power consumption in data centers due to the rapid development of cloud computing, big data analytics, and machine learning, etc. The prior approaches that optimize the power consumption of the information technology (IT) system and/or the cooling system always fail to capture the system dynamics or suffer from the complexity of system states and action spaces. In this paper, we propose a Deep Reinforcement Learning (DRL) based optimization framework, named DeepEE, to improve the energy efficiency for data centers by considering the IT and cooling systems concurrently. In DeepEE, we first propose a PArameterized action space based Deep Q-Network (PADQN) algorithm to solve the hybrid action space problem and jointly optimize the job scheduling for the IT system and the airflow rate adjustment for the cooling system. Then, a two-time-scale control mechanism is applied in PADQN to coordinate the IT and cooling systems more accurately and efficiently. In addition, to train and evaluate the proposed PADQN in a safe and quick way, we build a simulation platform to model the dynamics of IT workload and cooling systems simultaneously. Through extensive real-trace based simulations, we demonstrate that: 1) our algorithm can save up to 15% and 10% energy consumption in comparison with the baseline siloed and joint optimization approaches respectively; 2) our algorithm achieves more stable performance gain in terms of power consumption by adopting the parameterized action space; and 3) our algorithm leads to a better tradeoff between energy saving and service quality.

源语言英语
主期刊名Proceedings - 2019 39th IEEE International Conference on Distributed Computing Systems, ICDCS 2019
出版商Institute of Electrical and Electronics Engineers Inc.
645-655
页数11
ISBN(电子版)9781728125190
DOI
出版状态已出版 - 7月 2019
活动39th IEEE International Conference on Distributed Computing Systems, ICDCS 2019 - Richardson, 美国
期限: 7 7月 20199 7月 2019

出版系列

姓名Proceedings - International Conference on Distributed Computing Systems
2019-July

会议

会议39th IEEE International Conference on Distributed Computing Systems, ICDCS 2019
国家/地区美国
Richardson
时期7/07/199/07/19

指纹

探究 'DeepEE: Joint optimization of job scheduling and cooling control for data center energy efficiency using deep reinforcement learning' 的科研主题。它们共同构成独一无二的指纹。

引用此