Online Sequential Decision-Making with Unknown Delays

Ping Wu; Heyan Huang; Zhengyang Liu

doi:10.1145/3589334.3645388

Online Sequential Decision-Making with Unknown Delays

Ping Wu, Heyan Huang, Zhengyang Liu^*

^*Corresponding author for this work

School of Computer Science and Technology

Beijing Institute of Technology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

In the field of online sequential decision-making, we address the problem with delays utilizing the framework of online convex optimization (OCO), where the feedback of a decision can arrive with an unknown delay. Unlike previous research that is limited to Euclidean norm and gradient information, we propose three families of delayed algorithms based on approximate solutions to handle different types of received feedback. Our proposed algorithms are versatile and applicable to universal norms. Specifically, we introduce a family of Follow the Delayed Regularized Leader algorithms for feedback with full information on the loss function, a family of Delayed Mirror Descent algorithms for feedback with gradient information on the loss function and a family of Simplified Delayed Mirror Descent algorithms for feedback with the value information of the loss function's gradients at corresponding decision points. For each type of algorithm, we provide corresponding regret bounds under cases of general convexity and relative strong convexity, respectively. We also demonstrate the efficiency of each algorithm under different norms through concrete examples. Furthermore, our theoretical results are consistent with the current best bounds when degenerated to standard settings.

Original language	English
Title of host publication	WWW 2024 - Proceedings of the ACM Web Conference
Publisher	Association for Computing Machinery, Inc
Pages	4028-4036
Number of pages	9
ISBN (Electronic)	9798400701719
DOIs	https://doi.org/10.1145/3589334.3645388
Publication status	Published - 13 May 2024
Event	33rd ACM Web Conference, WWW 2024 - Singapore, Singapore Duration: 13 May 2024 → 17 May 2024

Publication series

Name	WWW 2024 - Proceedings of the ACM Web Conference

Conference

Conference	33rd ACM Web Conference, WWW 2024
Country/Territory	Singapore
City	Singapore
Period	13/05/24 → 17/05/24

Keywords

approximate solution
online convex optimization
sequential decision-making
unknown delays

Access to Document

10.1145/3589334.3645388

Cite this

Wu, P., Huang, H., & Liu, Z. (2024). Online Sequential Decision-Making with Unknown Delays. In WWW 2024 - Proceedings of the ACM Web Conference (pp. 4028-4036). (WWW 2024 - Proceedings of the ACM Web Conference). Association for Computing Machinery, Inc. https://doi.org/10.1145/3589334.3645388

@inproceedings{1cd227b3a3b048b1942b1cceb9a0d6ec,

title = "Online Sequential Decision-Making with Unknown Delays",

abstract = "In the field of online sequential decision-making, we address the problem with delays utilizing the framework of online convex optimization (OCO), where the feedback of a decision can arrive with an unknown delay. Unlike previous research that is limited to Euclidean norm and gradient information, we propose three families of delayed algorithms based on approximate solutions to handle different types of received feedback. Our proposed algorithms are versatile and applicable to universal norms. Specifically, we introduce a family of Follow the Delayed Regularized Leader algorithms for feedback with full information on the loss function, a family of Delayed Mirror Descent algorithms for feedback with gradient information on the loss function and a family of Simplified Delayed Mirror Descent algorithms for feedback with the value information of the loss function's gradients at corresponding decision points. For each type of algorithm, we provide corresponding regret bounds under cases of general convexity and relative strong convexity, respectively. We also demonstrate the efficiency of each algorithm under different norms through concrete examples. Furthermore, our theoretical results are consistent with the current best bounds when degenerated to standard settings.",

keywords = "approximate solution, online convex optimization, sequential decision-making, unknown delays",

author = "Ping Wu and Heyan Huang and Zhengyang Liu",

note = "Publisher Copyright: {\textcopyright} 2024 ACM.; 33rd ACM Web Conference, WWW 2024 ; Conference date: 13-05-2024 Through 17-05-2024",

year = "2024",

month = may,

day = "13",

doi = "10.1145/3589334.3645388",

language = "English",

series = "WWW 2024 - Proceedings of the ACM Web Conference",

publisher = "Association for Computing Machinery, Inc",

pages = "4028--4036",

booktitle = "WWW 2024 - Proceedings of the ACM Web Conference",

}

TY - GEN

T1 - Online Sequential Decision-Making with Unknown Delays

AU - Wu, Ping

AU - Huang, Heyan

AU - Liu, Zhengyang

PY - 2024/5/13

Y1 - 2024/5/13

N2 - In the field of online sequential decision-making, we address the problem with delays utilizing the framework of online convex optimization (OCO), where the feedback of a decision can arrive with an unknown delay. Unlike previous research that is limited to Euclidean norm and gradient information, we propose three families of delayed algorithms based on approximate solutions to handle different types of received feedback. Our proposed algorithms are versatile and applicable to universal norms. Specifically, we introduce a family of Follow the Delayed Regularized Leader algorithms for feedback with full information on the loss function, a family of Delayed Mirror Descent algorithms for feedback with gradient information on the loss function and a family of Simplified Delayed Mirror Descent algorithms for feedback with the value information of the loss function's gradients at corresponding decision points. For each type of algorithm, we provide corresponding regret bounds under cases of general convexity and relative strong convexity, respectively. We also demonstrate the efficiency of each algorithm under different norms through concrete examples. Furthermore, our theoretical results are consistent with the current best bounds when degenerated to standard settings.

AB - In the field of online sequential decision-making, we address the problem with delays utilizing the framework of online convex optimization (OCO), where the feedback of a decision can arrive with an unknown delay. Unlike previous research that is limited to Euclidean norm and gradient information, we propose three families of delayed algorithms based on approximate solutions to handle different types of received feedback. Our proposed algorithms are versatile and applicable to universal norms. Specifically, we introduce a family of Follow the Delayed Regularized Leader algorithms for feedback with full information on the loss function, a family of Delayed Mirror Descent algorithms for feedback with gradient information on the loss function and a family of Simplified Delayed Mirror Descent algorithms for feedback with the value information of the loss function's gradients at corresponding decision points. For each type of algorithm, we provide corresponding regret bounds under cases of general convexity and relative strong convexity, respectively. We also demonstrate the efficiency of each algorithm under different norms through concrete examples. Furthermore, our theoretical results are consistent with the current best bounds when degenerated to standard settings.

KW - approximate solution

KW - online convex optimization

KW - sequential decision-making

KW - unknown delays

UR - http://www.scopus.com/inward/record.url?scp=85194061571&partnerID=8YFLogxK

U2 - 10.1145/3589334.3645388

DO - 10.1145/3589334.3645388

M3 - Conference contribution

AN - SCOPUS:85194061571

T3 - WWW 2024 - Proceedings of the ACM Web Conference

SP - 4028

EP - 4036

BT - WWW 2024 - Proceedings of the ACM Web Conference

PB - Association for Computing Machinery, Inc

T2 - 33rd ACM Web Conference, WWW 2024

Y2 - 13 May 2024 through 17 May 2024

ER -

Online Sequential Decision-Making with Unknown Delays

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this