TY - JOUR
T1 - DAPIC
T2 - Dynamic adjustment method of parallelism for iterative computing in Flink
AU - Ji, Hangxu
AU - Sun, Yongjiao
AU - Su, Xinran
AU - Fu, Yuwei
AU - Yuan, Ye
AU - Wang, Guoren
AU - Wang, Qi
N1 - Publisher Copyright:
© 2024 Elsevier Inc.
PY - 2025/5
Y1 - 2025/5
N2 - The Apache Flink distributed computing system demonstrates significant advantages in iterative computing. Establishing appropriate parallelism for operators is crucial to further enhancing the efficiency of Flink's iterative jobs and optimizing cluster resource utilization. However, the encapsulated user-defined functions within operators cannot be directly analyzed, posing challenges for optimizing parallelism based on computation logic. Moreover, the diverse and complex nature of hardware resources in distributed computing clusters exacerbates this issue, making it difficult to model the relationship between optimal parallelism and computational resources. To address these challenges, this paper proposes the Dynamic Parallelism Adjustment (DAPIC) framework. The DAPIC framework introduces a mechanism for dynamically adjusting operator parallelism in batch processing, leveraging an enhanced GCN + GRU model. Additionally, it incorporates a resource-aware parallelism adjustment mechanism for stream processing, considering the availability of remaining computational resources. Experimental results show that the DAPIC framework improves the efficiency of iterative jobs in batch processing by 38.94% - 138.82% and in stream processing by 28.85% - 31.08%. In heterogeneous bandwidth environments, the advantages of the DAPIC framework become even more pronounced, achieving efficiency improvements exceeding 1.5 times. Furthermore, the framework effectively reduces TaskSlot occupancy, resulting in resource savings of 24.61% - 75.78%.
AB - The Apache Flink distributed computing system demonstrates significant advantages in iterative computing. Establishing appropriate parallelism for operators is crucial to further enhancing the efficiency of Flink's iterative jobs and optimizing cluster resource utilization. However, the encapsulated user-defined functions within operators cannot be directly analyzed, posing challenges for optimizing parallelism based on computation logic. Moreover, the diverse and complex nature of hardware resources in distributed computing clusters exacerbates this issue, making it difficult to model the relationship between optimal parallelism and computational resources. To address these challenges, this paper proposes the Dynamic Parallelism Adjustment (DAPIC) framework. The DAPIC framework introduces a mechanism for dynamically adjusting operator parallelism in batch processing, leveraging an enhanced GCN + GRU model. Additionally, it incorporates a resource-aware parallelism adjustment mechanism for stream processing, considering the availability of remaining computational resources. Experimental results show that the DAPIC framework improves the efficiency of iterative jobs in batch processing by 38.94% - 138.82% and in stream processing by 28.85% - 31.08%. In heterogeneous bandwidth environments, the advantages of the DAPIC framework become even more pronounced, achieving efficiency improvements exceeding 1.5 times. Furthermore, the framework effectively reduces TaskSlot occupancy, resulting in resource savings of 24.61% - 75.78%.
KW - Flink
KW - GRU
KW - Iterative computing
KW - Parallelism
KW - Time-dependent subJobGraph
UR - http://www.scopus.com/inward/record.url?scp=85213259718&partnerID=8YFLogxK
U2 - 10.1016/j.ins.2024.121803
DO - 10.1016/j.ins.2024.121803
M3 - Article
AN - SCOPUS:85213259718
SN - 0020-0255
VL - 699
JO - Information Sciences
JF - Information Sciences
M1 - 121803
ER -