TY - JOUR
T1 - Qora
T2 - Neural-Enhanced Interference-Aware Resource Provisioning for Serverless Computing
AU - Ma, Ruifeng
AU - Zhan, Yufeng
AU - Wu, Chuge
AU - Hong, Zicong
AU - Ali, Yasir
AU - Xia, Yuanqing
N1 - Publisher Copyright:
© 2004-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Serverless is an emerging cloud paradigm that offers fine-grained resource sharing through serverless functions. However, this resource sharing can cause interference, leading to performance degradation and QoS violations. Existing white box-based approaches for serverless resource provision often demand extensive expert knowledge, which is challenging to obtain due to the complexity of interference sources. This paper proposes QORA, a neural-enhanced interference-aware resource provisioning system for serverless computing. We model the resource provisioning of serverless functions as a novel combinatorial optimization problem, wherein the constraints on the queries per second are derived from neural network performance model. By leveraging neural networks to model the nonlinear performance fluctuations under various interference sources, our approach better captures the real-world behavior of serverless functions. To solve the formulated problem efficiently, rather than adopting commercial optimizer solvers like Gurobi, we propose a two-stage-VNS algorithm that searches discrete variables more efficiently and supports Sigmoid activations, avoiding introducing redundant discrete variables. Unlike pure machine learning methods lacking theoretical optimal guarantees, our approach is rigorously proven globally optimal based on optimization theory. We implement QORA on Kubernetes as a serverless system automating resource provisioning. Experimental results demonstrate that QORA reduces the QoS violation rate by 98% while reducing up to 35% resource costs compared with the state-of-the-arts.
AB - Serverless is an emerging cloud paradigm that offers fine-grained resource sharing through serverless functions. However, this resource sharing can cause interference, leading to performance degradation and QoS violations. Existing white box-based approaches for serverless resource provision often demand extensive expert knowledge, which is challenging to obtain due to the complexity of interference sources. This paper proposes QORA, a neural-enhanced interference-aware resource provisioning system for serverless computing. We model the resource provisioning of serverless functions as a novel combinatorial optimization problem, wherein the constraints on the queries per second are derived from neural network performance model. By leveraging neural networks to model the nonlinear performance fluctuations under various interference sources, our approach better captures the real-world behavior of serverless functions. To solve the formulated problem efficiently, rather than adopting commercial optimizer solvers like Gurobi, we propose a two-stage-VNS algorithm that searches discrete variables more efficiently and supports Sigmoid activations, avoiding introducing redundant discrete variables. Unlike pure machine learning methods lacking theoretical optimal guarantees, our approach is rigorously proven globally optimal based on optimization theory. We implement QORA on Kubernetes as a serverless system automating resource provisioning. Experimental results demonstrate that QORA reduces the QoS violation rate by 98% while reducing up to 35% resource costs compared with the state-of-the-arts.
KW - performance interference
KW - resource provisioning
KW - Serverless computing
UR - http://www.scopus.com/inward/record.url?scp=105003185036&partnerID=8YFLogxK
U2 - 10.1109/TASE.2025.3526197
DO - 10.1109/TASE.2025.3526197
M3 - Article
AN - SCOPUS:105003185036
SN - 1545-5955
VL - 22
SP - 10609
EP - 10624
JO - IEEE Transactions on Automation Science and Engineering
JF - IEEE Transactions on Automation Science and Engineering
ER -