Fast Cross-Validation for Kernel-Based Algorithms

Yong Liu; Shizhong Liao; Shali Jiang; Lizhong Ding; Hailun Lin; Weiping Wang

doi:10.1109/TPAMI.2019.2892371

Fast Cross-Validation for Kernel-Based Algorithms

Yong Liu^*, Shizhong Liao, Shali Jiang, Lizhong Ding, Hailun Lin, Weiping Wang

^*此作品的通讯作者

科研成果: 期刊稿件 › 文章 › 同行评审

35 引用（Scopus）

摘要

Cross-validation (CV) is a widely adopted approach for selecting the optimal model. However, the computation of empirical cross-validation error (CVE) has high complexity due to multiple times of learner training. In this paper, we develop a novel approximation theory of CVE and present an approximate approach to CV based on the Bouligand influence function (BIF) for kernel-based algorithms. We first represent the BIF and higher order BIFs in Taylor expansions, and approximate CV via the Taylor expansions. We then derive an upper bound of the discrepancy between the original and approximate CV. Furthermore, we provide a novel computing method to calculate the BIF for general distribution, and evaluate BIF criterion for sample distribution to approximate CV. The proposed approximate CV requires training on the full data set only once and is suitable for a wide variety of kernel-based algorithms. Experimental results demonstrate that the proposed approximate CV is sound and effective.

源语言	英语
文章编号	8611136
页（从-至）	1083-1096
页数	14
期刊	IEEE Transactions on Pattern Analysis and Machine Intelligence
卷	42
期	5
DOI	https://doi.org/10.1109/TPAMI.2019.2892371
出版状态	已出版 - 1 5月 2020
已对外发布	是

访问文件

10.1109/TPAMI.2019.2892371

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{f3a17a964682440eae6bb9cbd50f6347,

title = "Fast Cross-Validation for Kernel-Based Algorithms",

abstract = "Cross-validation (CV) is a widely adopted approach for selecting the optimal model. However, the computation of empirical cross-validation error (CVE) has high complexity due to multiple times of learner training. In this paper, we develop a novel approximation theory of CVE and present an approximate approach to CV based on the Bouligand influence function (BIF) for kernel-based algorithms. We first represent the BIF and higher order BIFs in Taylor expansions, and approximate CV via the Taylor expansions. We then derive an upper bound of the discrepancy between the original and approximate CV. Furthermore, we provide a novel computing method to calculate the BIF for general distribution, and evaluate BIF criterion for sample distribution to approximate CV. The proposed approximate CV requires training on the full data set only once and is suitable for a wide variety of kernel-based algorithms. Experimental results demonstrate that the proposed approximate CV is sound and effective.",

keywords = "Cross-validation, approximation, bouligand influence function, kernel methods, model selection",

author = "Yong Liu and Shizhong Liao and Shali Jiang and Lizhong Ding and Hailun Lin and Weiping Wang",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2020",

month = may,

day = "1",

doi = "10.1109/TPAMI.2019.2892371",

language = "English",

volume = "42",

pages = "1083--1096",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "5",

}

TY - JOUR

T1 - Fast Cross-Validation for Kernel-Based Algorithms

AU - Liu, Yong

AU - Liao, Shizhong

AU - Jiang, Shali

AU - Ding, Lizhong

AU - Lin, Hailun

AU - Wang, Weiping

PY - 2020/5/1

Y1 - 2020/5/1

N2 - Cross-validation (CV) is a widely adopted approach for selecting the optimal model. However, the computation of empirical cross-validation error (CVE) has high complexity due to multiple times of learner training. In this paper, we develop a novel approximation theory of CVE and present an approximate approach to CV based on the Bouligand influence function (BIF) for kernel-based algorithms. We first represent the BIF and higher order BIFs in Taylor expansions, and approximate CV via the Taylor expansions. We then derive an upper bound of the discrepancy between the original and approximate CV. Furthermore, we provide a novel computing method to calculate the BIF for general distribution, and evaluate BIF criterion for sample distribution to approximate CV. The proposed approximate CV requires training on the full data set only once and is suitable for a wide variety of kernel-based algorithms. Experimental results demonstrate that the proposed approximate CV is sound and effective.

AB - Cross-validation (CV) is a widely adopted approach for selecting the optimal model. However, the computation of empirical cross-validation error (CVE) has high complexity due to multiple times of learner training. In this paper, we develop a novel approximation theory of CVE and present an approximate approach to CV based on the Bouligand influence function (BIF) for kernel-based algorithms. We first represent the BIF and higher order BIFs in Taylor expansions, and approximate CV via the Taylor expansions. We then derive an upper bound of the discrepancy between the original and approximate CV. Furthermore, we provide a novel computing method to calculate the BIF for general distribution, and evaluate BIF criterion for sample distribution to approximate CV. The proposed approximate CV requires training on the full data set only once and is suitable for a wide variety of kernel-based algorithms. Experimental results demonstrate that the proposed approximate CV is sound and effective.

KW - Cross-validation

KW - approximation

KW - bouligand influence function

KW - kernel methods

KW - model selection

UR - http://www.scopus.com/inward/record.url?scp=85082984881&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2019.2892371

DO - 10.1109/TPAMI.2019.2892371

M3 - Article

C2 - 30640598

AN - SCOPUS:85082984881

SN - 0162-8828

VL - 42

SP - 1083

EP - 1096

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 5

M1 - 8611136

ER -

Fast Cross-Validation for Kernel-Based Algorithms

摘要

访问文件

其它文件与链接

指纹

引用此