Nearly optimal stochastic approximation for online principal subspace estimation

Xin Liang; Zhen Chen Guo; Li Wang; Ren Cang Li; Wen Wei Lin

doi:10.1007/s11425-021-1972-5

Nearly optimal stochastic approximation for online principal subspace estimation

Xin Liang, Zhen Chen Guo, Li Wang, Ren Cang Li^*, Wen Wei Lin

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Principal component analysis (PCA) has been widely used in analyzing high-dimensional data. It converts a set of observed data points of possibly correlated variables into a set of linearly uncorrelated variables via an orthogonal transformation. To handle streaming data and reduce the complexities of PCA, (subspace) online PCA iterations were proposed to iteratively update the orthogonal transformation by taking one observed data point at a time. Existing works on the convergence of (subspace) online PCA iterations mostly focus on the case where the samples are almost surely uniformly bounded. In this paper, we analyze the convergence of a subspace online PCA iteration under more practical assumption and obtain a nearly optimal finite-sample error bound. Our convergence rate almost matches the minimax information lower bound. We prove that the convergence is nearly global in the sense that the subspace online PCA iteration is convergent with high probability for random initial guesses. This work also leads to a simpler proof of the recent work on analyzing online PCA for the first principal component only.

Original language	English
Pages (from-to)	1087-1122
Number of pages	36
Journal	Science China Mathematics
Volume	66
Issue number	5
DOIs	https://doi.org/10.1007/s11425-021-1972-5
Publication status	Published - May 2023
Externally published	Yes

Keywords

62H12
62H25
65F99
68W27
finite-sample analysis
high-dimensional data
online algorithm
principal component analysis
principal component subspace
stochastic approximation

Access to Document

10.1007/s11425-021-1972-5

Cite this

Liang, X., Guo, Z. C., Wang, L., Li, R. C., & Lin, W. W. (2023). Nearly optimal stochastic approximation for online principal subspace estimation. Science China Mathematics, 66(5), 1087-1122. https://doi.org/10.1007/s11425-021-1972-5

@article{9546a6878de043658e9f3800303aef8d,

title = "Nearly optimal stochastic approximation for online principal subspace estimation",

abstract = "Principal component analysis (PCA) has been widely used in analyzing high-dimensional data. It converts a set of observed data points of possibly correlated variables into a set of linearly uncorrelated variables via an orthogonal transformation. To handle streaming data and reduce the complexities of PCA, (subspace) online PCA iterations were proposed to iteratively update the orthogonal transformation by taking one observed data point at a time. Existing works on the convergence of (subspace) online PCA iterations mostly focus on the case where the samples are almost surely uniformly bounded. In this paper, we analyze the convergence of a subspace online PCA iteration under more practical assumption and obtain a nearly optimal finite-sample error bound. Our convergence rate almost matches the minimax information lower bound. We prove that the convergence is nearly global in the sense that the subspace online PCA iteration is convergent with high probability for random initial guesses. This work also leads to a simpler proof of the recent work on analyzing online PCA for the first principal component only.",

keywords = "62H12, 62H25, 65F99, 68W27, finite-sample analysis, high-dimensional data, online algorithm, principal component analysis, principal component subspace, stochastic approximation",

author = "Xin Liang and Guo, {Zhen Chen} and Li Wang and Li, {Ren Cang} and Lin, {Wen Wei}",

note = "Publisher Copyright: {\textcopyright} 2022, Science China Press and Springer-Verlag GmbH Germany, part of Springer Nature.",

year = "2023",

month = may,

doi = "10.1007/s11425-021-1972-5",

language = "English",

volume = "66",

pages = "1087--1122",

journal = "Science China Mathematics",

issn = "1674-7283",

publisher = "Science China Press",

number = "5",

}

TY - JOUR

T1 - Nearly optimal stochastic approximation for online principal subspace estimation

AU - Liang, Xin

AU - Guo, Zhen Chen

AU - Wang, Li

AU - Li, Ren Cang

AU - Lin, Wen Wei

PY - 2023/5

Y1 - 2023/5

N2 - Principal component analysis (PCA) has been widely used in analyzing high-dimensional data. It converts a set of observed data points of possibly correlated variables into a set of linearly uncorrelated variables via an orthogonal transformation. To handle streaming data and reduce the complexities of PCA, (subspace) online PCA iterations were proposed to iteratively update the orthogonal transformation by taking one observed data point at a time. Existing works on the convergence of (subspace) online PCA iterations mostly focus on the case where the samples are almost surely uniformly bounded. In this paper, we analyze the convergence of a subspace online PCA iteration under more practical assumption and obtain a nearly optimal finite-sample error bound. Our convergence rate almost matches the minimax information lower bound. We prove that the convergence is nearly global in the sense that the subspace online PCA iteration is convergent with high probability for random initial guesses. This work also leads to a simpler proof of the recent work on analyzing online PCA for the first principal component only.

AB - Principal component analysis (PCA) has been widely used in analyzing high-dimensional data. It converts a set of observed data points of possibly correlated variables into a set of linearly uncorrelated variables via an orthogonal transformation. To handle streaming data and reduce the complexities of PCA, (subspace) online PCA iterations were proposed to iteratively update the orthogonal transformation by taking one observed data point at a time. Existing works on the convergence of (subspace) online PCA iterations mostly focus on the case where the samples are almost surely uniformly bounded. In this paper, we analyze the convergence of a subspace online PCA iteration under more practical assumption and obtain a nearly optimal finite-sample error bound. Our convergence rate almost matches the minimax information lower bound. We prove that the convergence is nearly global in the sense that the subspace online PCA iteration is convergent with high probability for random initial guesses. This work also leads to a simpler proof of the recent work on analyzing online PCA for the first principal component only.

KW - 62H12

KW - 62H25

KW - 65F99

KW - 68W27

KW - finite-sample analysis

KW - high-dimensional data

KW - online algorithm

KW - principal component analysis

KW - principal component subspace

KW - stochastic approximation

UR - http://www.scopus.com/inward/record.url?scp=85137520826&partnerID=8YFLogxK

U2 - 10.1007/s11425-021-1972-5

DO - 10.1007/s11425-021-1972-5

M3 - Article

AN - SCOPUS:85137520826

SN - 1674-7283

VL - 66

SP - 1087

EP - 1122

JO - Science China Mathematics

JF - Science China Mathematics

IS - 5

ER -

Nearly optimal stochastic approximation for online principal subspace estimation

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this