On the constrained online convex optimization with feedback delay

Heyan Huang, Ping Wu*, Haolin Lu, Zhengyang Liu

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

We investigate the problem of online convex optimization (OCO) under feedback delay, where feedback for a decision is received after a delay, and long-term constraints, where constraints can be violated in intermediate iterations but must be satisfied over the long run. Existing approaches are primarily limited to fixed delay settings and general convex loss functions. In this paper, we employ a stricter metric based on cumulative constraint violations. We first propose a novel algorithm tailored for the fixed d-slot delay setting, achieving a regret bound of O(dT) and a cumulative constraint violation of O (T[Formula presented]), demonstrating superior performance compared to existing results. Moreover, when the loss functions are strongly convex, the regret and violation bounds can be further reduced to O (dlnT) and O (dlnT), respectively. Additionally, we extend our algorithm to the more realistic re-indexed delay setting, achieving O(dT) regret and O(T[Formula presented]) cumulative constraint violation. Under strong convexity, these bounds are further improved to O(dˆlnT) and O(dˆlnT), where dˆ=maxt∈[T]dt denotes the maximum delay.

Original languageEnglish
Article number127871
JournalExpert Systems with Applications
Volume287
DOIs
Publication statusPublished - 25 Aug 2025
Externally publishedYes

Keywords

  • Constraint violation
  • Feedback delay
  • Long-term constraint
  • Online convex optimization
  • Regret bound

Fingerprint

Dive into the research topics of 'On the constrained online convex optimization with feedback delay'. Together they form a unique fingerprint.

Cite this