RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay

Hanjie Li; Yue Ning; Yang Bao; Changsheng Li; Boxiao Chen; Xingyu Lu; Ye Yuan; Guoren Wang

doi:10.1145/3627673.3680108

RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay

Hanjie Li, Yue Ning, Yang Bao, Changsheng Li^*, Boxiao Chen, Xingyu Lu, Ye Yuan, Guoren Wang

^*此作品的通讯作者

Beijing Institute of Technology

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Industrial-scale linear assignment problems (LAPs) are frequently encountered in various industrial scenarios, e.g., asset allocation within the domain of credit management. However, optimization algorithms for such problems (e.g., PJ-ADMM) are highly sensitive to hyper-parameters. Existing solving systems rely on empirical parameter selection, which is challenging to achieve convergence and extremely time-consuming. Additionally, the resulting parameter rules are often inefficient. To alleviate this issue, we propose RL-ISLAP, an efficient and lightweight Reinforcement Learning framework for Industrial-Scale Linear Assignment Problems. We formulate the hyper-parameter selection for PJ-ADMM as a sequential decision problem and leverage reinforcement learning to enhance its convergence. Addressing the sparse reward challenge inherent in learning policies for such problems, we devise auxiliary rewards to provide dense signals for policy optimization, and present a rollback mechanism to prevent divergence in the solving process. Experiments on OR-Library benchmark demonstrate that our method is competitive to SOTA stand-alone solvers. Furthermore, the scale-independent design of observations enables us to transfer the acquired hyper-parameter policy to a scenario of LAPs in varying scales. On two real-world industrial-scale LAPs with up to 10 millions of decision variables, our proposed RL-ISLAP achieves solutions of comparable quality in 2/3 of the time when compared to the SOTA distributed solving system employing fine-tuned empirical parameter rules.

源语言	英语
主期刊名	CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management
出版商	Association for Computing Machinery
页	4661-4668
页数	8
ISBN（电子版）	9798400704369
DOI	https://doi.org/10.1145/3627673.3680108
出版状态	已出版 - 21 10月 2024
活动	33rd ACM International Conference on Information and Knowledge Management, CIKM 2024 - Boise, 美国期限: 21 10月 2024 → 25 10月 2024

出版系列

姓名	International Conference on Information and Knowledge Management, Proceedings
ISSN（印刷版）	2155-0751

会议

会议	33rd ACM International Conference on Information and Knowledge Management, CIKM 2024
国家/地区	美国
市	Boise
时期	21/10/24 → 25/10/24

访问文件

10.1145/3627673.3680108

其它文件与链接

链接到 Scopus 的出版物

引用此

Li, H., Ning, Y., Bao, Y., Li, C., Chen, B., Lu, X., Yuan, Y., & Wang, G. (2024). RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay. 在 CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (页码 4661-4668). (International Conference on Information and Knowledge Management, Proceedings). Association for Computing Machinery. https://doi.org/10.1145/3627673.3680108

Li, Hanjie ; Ning, Yue ; Bao, Yang 等. / RL-ISLAP : A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay. CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management. Association for Computing Machinery, 2024. 页码 4661-4668 (International Conference on Information and Knowledge Management, Proceedings).

@inproceedings{42c5f9426e314cf48a6c288086323656,

title = "RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay",

abstract = "Industrial-scale linear assignment problems (LAPs) are frequently encountered in various industrial scenarios, e.g., asset allocation within the domain of credit management. However, optimization algorithms for such problems (e.g., PJ-ADMM) are highly sensitive to hyper-parameters. Existing solving systems rely on empirical parameter selection, which is challenging to achieve convergence and extremely time-consuming. Additionally, the resulting parameter rules are often inefficient. To alleviate this issue, we propose RL-ISLAP, an efficient and lightweight Reinforcement Learning framework for Industrial-Scale Linear Assignment Problems. We formulate the hyper-parameter selection for PJ-ADMM as a sequential decision problem and leverage reinforcement learning to enhance its convergence. Addressing the sparse reward challenge inherent in learning policies for such problems, we devise auxiliary rewards to provide dense signals for policy optimization, and present a rollback mechanism to prevent divergence in the solving process. Experiments on OR-Library benchmark demonstrate that our method is competitive to SOTA stand-alone solvers. Furthermore, the scale-independent design of observations enables us to transfer the acquired hyper-parameter policy to a scenario of LAPs in varying scales. On two real-world industrial-scale LAPs with up to 10 millions of decision variables, our proposed RL-ISLAP achieves solutions of comparable quality in 2/3 of the time when compared to the SOTA distributed solving system employing fine-tuned empirical parameter rules.",

keywords = "large-scale optimization, linear assignment problem, PJ-ADMM, reinforcement learning",

author = "Hanjie Li and Yue Ning and Yang Bao and Changsheng Li and Boxiao Chen and Xingyu Lu and Ye Yuan and Guoren Wang",

note = "Publisher Copyright: {\textcopyright} 2024 ACM.; 33rd ACM International Conference on Information and Knowledge Management, CIKM 2024 ; Conference date: 21-10-2024 Through 25-10-2024",

year = "2024",

month = oct,

day = "21",

doi = "10.1145/3627673.3680108",

language = "English",

series = "International Conference on Information and Knowledge Management, Proceedings",

publisher = "Association for Computing Machinery",

pages = "4661--4668",

booktitle = "CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management",

}

Li, H, Ning, Y, Bao, Y, Li, C, Chen, B, Lu, X, Yuan, Y & Wang, G 2024, RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay. 在 CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management. International Conference on Information and Knowledge Management, Proceedings, Association for Computing Machinery, 页码 4661-4668, 33rd ACM International Conference on Information and Knowledge Management, CIKM 2024, Boise, 美国, 21/10/24. https://doi.org/10.1145/3627673.3680108

RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay. / Li, Hanjie; Ning, Yue; Bao, Yang 等.
CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management. Association for Computing Machinery, 2024. 页码 4661-4668 (International Conference on Information and Knowledge Management, Proceedings).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - RL-ISLAP

T2 - 33rd ACM International Conference on Information and Knowledge Management, CIKM 2024

AU - Li, Hanjie

AU - Ning, Yue

AU - Bao, Yang

AU - Li, Changsheng

AU - Chen, Boxiao

AU - Lu, Xingyu

AU - Yuan, Ye

AU - Wang, Guoren

PY - 2024/10/21

Y1 - 2024/10/21

N2 - Industrial-scale linear assignment problems (LAPs) are frequently encountered in various industrial scenarios, e.g., asset allocation within the domain of credit management. However, optimization algorithms for such problems (e.g., PJ-ADMM) are highly sensitive to hyper-parameters. Existing solving systems rely on empirical parameter selection, which is challenging to achieve convergence and extremely time-consuming. Additionally, the resulting parameter rules are often inefficient. To alleviate this issue, we propose RL-ISLAP, an efficient and lightweight Reinforcement Learning framework for Industrial-Scale Linear Assignment Problems. We formulate the hyper-parameter selection for PJ-ADMM as a sequential decision problem and leverage reinforcement learning to enhance its convergence. Addressing the sparse reward challenge inherent in learning policies for such problems, we devise auxiliary rewards to provide dense signals for policy optimization, and present a rollback mechanism to prevent divergence in the solving process. Experiments on OR-Library benchmark demonstrate that our method is competitive to SOTA stand-alone solvers. Furthermore, the scale-independent design of observations enables us to transfer the acquired hyper-parameter policy to a scenario of LAPs in varying scales. On two real-world industrial-scale LAPs with up to 10 millions of decision variables, our proposed RL-ISLAP achieves solutions of comparable quality in 2/3 of the time when compared to the SOTA distributed solving system employing fine-tuned empirical parameter rules.

AB - Industrial-scale linear assignment problems (LAPs) are frequently encountered in various industrial scenarios, e.g., asset allocation within the domain of credit management. However, optimization algorithms for such problems (e.g., PJ-ADMM) are highly sensitive to hyper-parameters. Existing solving systems rely on empirical parameter selection, which is challenging to achieve convergence and extremely time-consuming. Additionally, the resulting parameter rules are often inefficient. To alleviate this issue, we propose RL-ISLAP, an efficient and lightweight Reinforcement Learning framework for Industrial-Scale Linear Assignment Problems. We formulate the hyper-parameter selection for PJ-ADMM as a sequential decision problem and leverage reinforcement learning to enhance its convergence. Addressing the sparse reward challenge inherent in learning policies for such problems, we devise auxiliary rewards to provide dense signals for policy optimization, and present a rollback mechanism to prevent divergence in the solving process. Experiments on OR-Library benchmark demonstrate that our method is competitive to SOTA stand-alone solvers. Furthermore, the scale-independent design of observations enables us to transfer the acquired hyper-parameter policy to a scenario of LAPs in varying scales. On two real-world industrial-scale LAPs with up to 10 millions of decision variables, our proposed RL-ISLAP achieves solutions of comparable quality in 2/3 of the time when compared to the SOTA distributed solving system employing fine-tuned empirical parameter rules.

KW - large-scale optimization

KW - linear assignment problem

KW - PJ-ADMM

KW - reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85210041142&partnerID=8YFLogxK

U2 - 10.1145/3627673.3680108

DO - 10.1145/3627673.3680108

M3 - Conference contribution

AN - SCOPUS:85210041142

T3 - International Conference on Information and Knowledge Management, Proceedings

SP - 4661

EP - 4668

BT - CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management

PB - Association for Computing Machinery

Y2 - 21 October 2024 through 25 October 2024

ER -

Li H, Ning Y, Bao Y, Li C, Chen B, Lu X 等. RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay. 在 CIKM 2024 - Proceedings of the 33rd ACM International Conference on Information and Knowledge Management. Association for Computing Machinery. 2024. 页码 4661-4668. (International Conference on Information and Knowledge Management, Proceedings). doi: 10.1145/3627673.3680108

RL-ISLAP: A Reinforcement Learning Framework for Industrial-Scale Linear Assignment Problems at Alipay

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此