TY - GEN
T1 - A Unified Framework and a Case Study for Hyperparameter Selection in Machine Learning via Bilevel Optimization
AU - Li, Zhen
AU - Qian, Yaru
AU - Li, Qingna
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Support vector classification (SVC) is a classical and widely used method for classification problems. A regularization parameter, which significantly affects the classification performance, has to be chosen. In this paper, we propose a unified framework for hyperparameter selection in machine learning via bilevel optimization. Specially, we study the case for l2-loss SVC in which the upper-level objective function is the hinge loss function, and the lower-level problems are l2-loss SVC models. Following the same way as in our recent work, we reformulate the bilevel optimization model as a mathematical program with equilibrium constraints (MPEC). Similarly to that in our recent work, the MPEC-tailored version of the Mangasarian-Fromovitz constraint qualification (MPEC-MFCQ) automatically holds at each feasible point of the MPEC. Extensive numerical results verify that the global relaxation cross-validation algorithm for l2-loss SVC (GRCV-l2) enjoys superior generalization performance over almost all the datasets used in this paper.
AB - Support vector classification (SVC) is a classical and widely used method for classification problems. A regularization parameter, which significantly affects the classification performance, has to be chosen. In this paper, we propose a unified framework for hyperparameter selection in machine learning via bilevel optimization. Specially, we study the case for l2-loss SVC in which the upper-level objective function is the hinge loss function, and the lower-level problems are l2-loss SVC models. Following the same way as in our recent work, we reformulate the bilevel optimization model as a mathematical program with equilibrium constraints (MPEC). Similarly to that in our recent work, the MPEC-tailored version of the Mangasarian-Fromovitz constraint qualification (MPEC-MFCQ) automatically holds at each feasible point of the MPEC. Extensive numerical results verify that the global relaxation cross-validation algorithm for l2-loss SVC (GRCV-l2) enjoys superior generalization performance over almost all the datasets used in this paper.
KW - Bilevel optimization
KW - C-stationarity
KW - Hyperparameter selection
KW - Mathematical program with equilibrium constraints
KW - Support vector classification
UR - http://www.scopus.com/inward/record.url?scp=85143152574&partnerID=8YFLogxK
U2 - 10.1109/DSIT55514.2022.9943929
DO - 10.1109/DSIT55514.2022.9943929
M3 - Conference contribution
AN - SCOPUS:85143152574
T3 - 2022 5th International Conference on Data Science and Information Technology, DSIT 2022 - Proceedings
BT - 2022 5th International Conference on Data Science and Information Technology, DSIT 2022 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th International Conference on Data Science and Information Technology, DSIT 2022
Y2 - 22 July 2022 through 24 July 2022
ER -