RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay

Hanjie Li, Yue Ning, Yang Bao, Changsheng Li*, Boxiao Chen, Xingyu Lu, Ye Yuan, Guoren Wang

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Industrial-scale linear assignment problems (LAPs) are frequently encountered in various industrial scenarios, e.g., asset allocation within the domain of credit management. However, optimization algorithms for such problems (e.g., PJ-ADMM) are highly sensitive to hyper-parameters. Existing solving systems rely on empirical parameter selection, which is challenging to achieve convergence and extremely time-consuming. Additionally, the resulting parameter rules are often inefficient. To alleviate this issue, we propose RL-ISLAP, an efficient and lightweight Reinforcement Learning framework for Industrial-Scale Linear Assignment Problems. We formulate the hyper-parameter selection for PJ-ADMM as a sequential decision problem and leverage reinforcement learning to enhance its convergence. Addressing the sparse reward challenge inherent in learning policies for such problems, we devise auxiliary rewards to provide dense signals for policy optimization, and present a rollback mechanism to prevent divergence in the solving process. Experiments on OR-Library benchmark demonstrate that our method is competitive to SOTA stand-alone solvers. Furthermore, the scale-independent design of observations enables us to transfer the acquired hyper-parameter policy to a scenario of LAPs in varying scales. On two real-world industrial-scale LAPs with up to 10 millions of decision variables, our proposed RL-ISLAP achieves solutions of comparable quality in 2/3 of the time when compared to the SOTA distributed solving system employing fine-tuned empirical parameter rules.

源语言英语
主期刊名CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management
出版商Association for Computing Machinery
4661-4668
页数8
ISBN(电子版)9798400704369
DOI
出版状态已出版 - 21 10月 2024
活动33rd ACM International Conference on Information and Knowledge Management, CIKM 2024 - Boise, 美国
期限: 21 10月 202425 10月 2024

出版系列

姓名International Conference on Information and Knowledge Management, Proceedings
ISSN(印刷版)2155-0751

会议

会议33rd ACM International Conference on Information and Knowledge Management, CIKM 2024
国家/地区美国
Boise
时期21/10/2425/10/24

指纹

探究 'RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay' 的科研主题。它们共同构成独一无二的指纹。

引用此

Li, H., Ning, Y., Bao, Y., Li, C., Chen, B., Lu, X., Yuan, Y., & Wang, G. (2024). RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay. 在 CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (页码 4661-4668). (International Conference on Information and Knowledge Management, Proceedings). Association for Computing Machinery. https://doi.org/10.1145/3627673.3680108