Abstract
We study the problem of estimating a sparse vector β ∈ Rp from the response variables y = Xβ + w, where w ∼ N(0, σw 2 In×n), under the following high-dimensional asymptotic regime: given a fixed number δ, p → ∞, while n/p → δ. We consider the popular class of q-regularized least squares (LQLS), a.k.a. bridge estimators, given by the optimization problem β(λ, q) ∈ arg min 1 2 y − Xβ2 2 + λβq q, β and characterize the almost sure limit of p 1 β(λ, q)− β2 2, and call it asymptotic mean square error (AMSE). The expression we derive for this limit does not have explicit forms, and hence is not useful in comparing LQLS for different values of q, or providing information in evaluating the effect of δ or sparsity level of β. To simplify the expression, researchers have considered the ideal “error-free” regime, that is, w = 0, and have characterized the values of δ for which AMSE is zero. This is known as the phase transition analysis. In this paper, we first perform the phase transition analysis of LQLS. Our results reveal some of the limitations and misleading features of the phase transition analysis. To overcome these limitations, we propose the small error analysis of LQLS. Our new analysis framework not only sheds light on the results of the phase transition analysis, but also describes when phase transition analysis is reliable, and presents a more accurate comparison among different regularizers.
Original language | English |
---|---|
Pages (from-to) | 3099-3129 |
Number of pages | 31 |
Journal | Annals of Statistics |
Volume | 46 |
Issue number | 6A |
DOIs | |
Publication status | Published - 2018 |
Externally published | Yes |
Keywords
- Asymptotic mean square error
- Bridge regression
- Comparison of estimators
- Optimal tuning
- Phase transition
- Second-order term
- Small error regime