TY - JOUR
T1 - Smoothing Splines Approximation Using Hilbert Curve Basis Selection
AU - Meng, Cheng
AU - Yu, Jun
AU - Chen, Yongkai
AU - Zhong, Wenxuan
AU - Ma, Ping
N1 - Publisher Copyright:
© 2022 American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America.
PY - 2022
Y1 - 2022
N2 - Smoothing splines have been used pervasively in nonparametric regressions. However, the computational burden of smoothing splines is significant when the sample size n is large. When the number of predictors (Formula presented.), the computational cost for smoothing splines is at the order of (Formula presented.) using the standard approach. Many methods have been developed to approximate smoothing spline estimators by using q basis functions instead of n ones, resulting in a computational cost of the order (Formula presented.). These methods are called the basis selection methods. Despite algorithmic benefits, most of the basis selection methods require the assumption that the sample is uniformly distributed on a hypercube. These methods may have deteriorating performance when such an assumption is not met. To overcome the obstacle, we develop an efficient algorithm that is adaptive to the unknown probability density function of the predictors. Theoretically, we show the proposed estimator has the same convergence rate as the full-basis estimator when q is roughly at the order of (Formula presented.), where (Formula presented.) and (Formula presented.) are some constants depend on the type of the spline. Numerical studies on various synthetic datasets demonstrate the superior performance of the proposed estimator in comparison with mainstream competitors. Supplementary files for this article are available online.
AB - Smoothing splines have been used pervasively in nonparametric regressions. However, the computational burden of smoothing splines is significant when the sample size n is large. When the number of predictors (Formula presented.), the computational cost for smoothing splines is at the order of (Formula presented.) using the standard approach. Many methods have been developed to approximate smoothing spline estimators by using q basis functions instead of n ones, resulting in a computational cost of the order (Formula presented.). These methods are called the basis selection methods. Despite algorithmic benefits, most of the basis selection methods require the assumption that the sample is uniformly distributed on a hypercube. These methods may have deteriorating performance when such an assumption is not met. To overcome the obstacle, we develop an efficient algorithm that is adaptive to the unknown probability density function of the predictors. Theoretically, we show the proposed estimator has the same convergence rate as the full-basis estimator when q is roughly at the order of (Formula presented.), where (Formula presented.) and (Formula presented.) are some constants depend on the type of the spline. Numerical studies on various synthetic datasets demonstrate the superior performance of the proposed estimator in comparison with mainstream competitors. Supplementary files for this article are available online.
KW - Nonparametric regression
KW - Penalized least squares
KW - Space-filling curve
KW - Subsampling
UR - http://www.scopus.com/inward/record.url?scp=85122805668&partnerID=8YFLogxK
U2 - 10.1080/10618600.2021.2002161
DO - 10.1080/10618600.2021.2002161
M3 - Article
AN - SCOPUS:85122805668
SN - 1061-8600
VL - 31
SP - 802
EP - 812
JO - Journal of Computational and Graphical Statistics
JF - Journal of Computational and Graphical Statistics
IS - 3
ER -