Stochastic Frank-Wolfe Algorithm for Constrained Bilevel Optimization With Improved Per-Iteration Complexity

Jie Hou, Xianlin Zeng*, Shisheng Cui, Xia Jiang, Jian Sun

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Bilevel optimization, where one optimization problem is inherently nested within another, has gained significant attention due to its extensive applications in machine learning, such as hyperparameter optimization and meta-learning. Most existing algorithms are designed to address unconstrained bilevel optimization problems, with few are capable of effectively tackling complex constrained settings. To address this gap, we propose a novel, fully single-loop stochastic Frank-Wolfe algorithm. This algorithm incorporates a Hessian-inverse-vector approximation technique, momentum-based gradient tracking, and a Frank-Wolfe update. Our proposed algorithm improves per-iteration complexity and achieves lower sample complexity compared to existing Frank-Wolfe algorithms for bilevel optimization. We also conduct numerical simulations to demonstrate the efficacy of our algorithm compared to state-of-the-art methods.

Original languageEnglish
Pages (from-to)3237-3252
Number of pages16
JournalIEEE Transactions on Signal Processing
Volume73
DOIs
Publication statusPublished - 2025

Keywords

  • bilevel optimization
  • hypergradient approximation
  • Stochastic Frank-Wolfe algorithm
  • variance reduction technique

Fingerprint

Dive into the research topics of 'Stochastic Frank-Wolfe Algorithm for Constrained Bilevel Optimization With Improved Per-Iteration Complexity'. Together they form a unique fingerprint.

Cite this