Adaptive online convex optimization with unknown feedback delay

Research output: Contribution to journalArticlepeer-review

Abstract

Online Convex Optimization (OCO) with unknown feedback delay presents a considerable challenge, particularly when the delay period is not available a priori. In this paper, we propose an adaptive delayed mirror descent (ADMD) algorithm to address this issue, which incorporates a virtual iterate sequence and a learning rate based on the cumulative missed feedback instances. This method improves regret bounds and eliminates the need for prior knowledge of the delay period. Furthermore, we transform the ADMD algorithm into adaptive delayed dual averaging (ADDA) using lazy gradient descent, establishing a connection between these two frameworks. To further enhance the algorithm's adaptability, we introduce a novel delayed doubling trick. Through extensive experiments, we demonstrate the efficacy of our approach, showing superior performance compared to existing algorithms.

Original languageEnglish
Article number129269
JournalExpert Systems with Applications
Volume297
DOIs
Publication statusPublished - 1 Feb 2026
Externally publishedYes

Keywords

  • Dual averaging
  • Learning rate
  • Mirror descent
  • Online convex optimization
  • Unknown feedback delay

Fingerprint

Dive into the research topics of 'Adaptive online convex optimization with unknown feedback delay'. Together they form a unique fingerprint.

Cite this