Online Sequential Decision-Making with Unknown Delays

Ping Wu, Heyan Huang, Zhengyang Liu*

*此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In the field of online sequential decision-making, we address the problem with delays utilizing the framework of online convex optimization (OCO), where the feedback of a decision can arrive with an unknown delay. Unlike previous research that is limited to Euclidean norm and gradient information, we propose three families of delayed algorithms based on approximate solutions to handle different types of received feedback. Our proposed algorithms are versatile and applicable to universal norms. Specifically, we introduce a family of Follow the Delayed Regularized Leader algorithms for feedback with full information on the loss function, a family of Delayed Mirror Descent algorithms for feedback with gradient information on the loss function and a family of Simplified Delayed Mirror Descent algorithms for feedback with the value information of the loss function's gradients at corresponding decision points. For each type of algorithm, we provide corresponding regret bounds under cases of general convexity and relative strong convexity, respectively. We also demonstrate the efficiency of each algorithm under different norms through concrete examples. Furthermore, our theoretical results are consistent with the current best bounds when degenerated to standard settings.

源语言英语
主期刊名WWW 2024 - Proceedings of the ACM Web Conference
出版商Association for Computing Machinery, Inc
4028-4036
页数9
ISBN(电子版)9798400701719
DOI
出版状态已出版 - 13 5月 2024
活动33rd ACM Web Conference, WWW 2024 - Singapore, 新加坡
期限: 13 5月 202417 5月 2024

出版系列

姓名WWW 2024 - Proceedings of the ACM Web Conference

会议

会议33rd ACM Web Conference, WWW 2024
国家/地区新加坡
Singapore
时期13/05/2417/05/24

指纹

探究 'Online Sequential Decision-Making with Unknown Delays' 的科研主题。它们共同构成独一无二的指纹。

引用此