TY - JOUR
T1 - Dynamic data processing, backup and abort strategy for cloud computing systems
AU - Wang, Jian Cai
AU - Qiu, Zhengxia
AU - Qiu, Qingan
N1 - Publisher Copyright:
© 2026 Elsevier Ltd.
PY - 2026/12
Y1 - 2026/12
N2 - Cloud computing systems are vulnerable to random shocks during operation, which may lead to performance degradation, system failure, and data loss. Data backup and mission abort are two key measures for mitigating failure risk and reducing data loss. However, existing backup and mission abort strategies, largely based on periodic schedules or static abort thresholds, are often insufficiently adaptive under deteriorating system conditions and stochastic mission progress. This paper develops a state-dependent dynamic framework that jointly optimizes sequential backup and abort decisions at discrete decision epochs. At each decision epoch, the system dynamically determines whether to remain in the current operating mode, switch between data processing and data backup, or abort the mission, based on the current state defined by the processed workload, the backed-up workload, the elapsed time, and the cumulative shock count. Within this framework, a probabilistic state-transition model is established, from which recursive expressions for the mission success probability and expected loss are derived. The optimal policy and the corresponding optimal decision interval are then obtained by backward dynamic programming. Numerical results show that the dynamic policy outperforms benchmark static policies in terms of expected loss and exhibits clear structural properties. Robustness analyses under several extended settings further confirm the effectiveness and practical applicability of the proposed framework.
AB - Cloud computing systems are vulnerable to random shocks during operation, which may lead to performance degradation, system failure, and data loss. Data backup and mission abort are two key measures for mitigating failure risk and reducing data loss. However, existing backup and mission abort strategies, largely based on periodic schedules or static abort thresholds, are often insufficiently adaptive under deteriorating system conditions and stochastic mission progress. This paper develops a state-dependent dynamic framework that jointly optimizes sequential backup and abort decisions at discrete decision epochs. At each decision epoch, the system dynamically determines whether to remain in the current operating mode, switch between data processing and data backup, or abort the mission, based on the current state defined by the processed workload, the backed-up workload, the elapsed time, and the cumulative shock count. Within this framework, a probabilistic state-transition model is established, from which recursive expressions for the mission success probability and expected loss are derived. The optimal policy and the corresponding optimal decision interval are then obtained by backward dynamic programming. Numerical results show that the dynamic policy outperforms benchmark static policies in terms of expected loss and exhibits clear structural properties. Robustness analyses under several extended settings further confirm the effectiveness and practical applicability of the proposed framework.
KW - Cloud computing systems
KW - Data backup
KW - Expected loss
KW - Mission abort
KW - Mission success probability
UR - https://www.scopus.com/pages/publications/105039518168
U2 - 10.1016/j.ress.2026.112835
DO - 10.1016/j.ress.2026.112835
M3 - Article
AN - SCOPUS:105039518168
SN - 0951-8320
VL - 276
JO - Reliability Engineering and System Safety
JF - Reliability Engineering and System Safety
M1 - 112835
ER -