TY - JOUR
T1 - A Survey on Deep Reinforcement Learning for Data Processing and Analytics
AU - Cai, Qingpeng
AU - Cui, Can
AU - Xiong, Yiyuan
AU - Wang, Wei
AU - Xie, Zhongle
AU - Zhang, Meihui
N1 - Publisher Copyright:
© 1989-2012 IEEE.
PY - 2023/5/1
Y1 - 2023/5/1
N2 - Data processing and analytics are fundamental and pervasive. Algorithms play a vital role in data processing and analytics where many algorithm designs have incorporated heuristics and general rules from human knowledge and experience to improve their effectiveness. Recently, reinforcement learning, deep reinforcement learning (DRL) in particular, is increasingly explored and exploited in many areas because it can learn better strategies in complicated environments it is interacting with than statically designed algorithms. Motivated by this trend, we provide a comprehensive review of recent works focusing on utilizing DRL to improve data processing and analytics. First, we present an introduction to key concepts, theories, and methods in DRL. Next, we discuss DRL deployment on database systems, facilitating data processing and analytics in various aspects, including data organization, scheduling, tuning, and indexing. Then, we survey the application of DRL in data processing and analytics, ranging from data preparation, natural language processing to healthcare, fintech, etc. Finally, we discuss important open challenges and future research directions of using DRL in data processing and analytics.
AB - Data processing and analytics are fundamental and pervasive. Algorithms play a vital role in data processing and analytics where many algorithm designs have incorporated heuristics and general rules from human knowledge and experience to improve their effectiveness. Recently, reinforcement learning, deep reinforcement learning (DRL) in particular, is increasingly explored and exploited in many areas because it can learn better strategies in complicated environments it is interacting with than statically designed algorithms. Motivated by this trend, we provide a comprehensive review of recent works focusing on utilizing DRL to improve data processing and analytics. First, we present an introduction to key concepts, theories, and methods in DRL. Next, we discuss DRL deployment on database systems, facilitating data processing and analytics in various aspects, including data organization, scheduling, tuning, and indexing. Then, we survey the application of DRL in data processing and analytics, ranging from data preparation, natural language processing to healthcare, fintech, etc. Finally, we discuss important open challenges and future research directions of using DRL in data processing and analytics.
KW - Deep reinforcement learning
KW - data processing and analytics
KW - database
KW - system optimization
UR - http://www.scopus.com/inward/record.url?scp=85125724952&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2022.3155196
DO - 10.1109/TKDE.2022.3155196
M3 - Article
AN - SCOPUS:85125724952
SN - 1041-4347
VL - 35
SP - 4446
EP - 4465
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 5
ER -