跳到主要导航 跳到搜索 跳到主要内容

An Innovative Subsampling Approach for Efficient SVM Training with Large Datasets

  • Beijing Institute of Technology
  • Institute of Statistics and Big Data

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Support vector machines (SVMs) are widely recognized for their effectiveness in handling classification problems, owing to their solid theoretical foundation and excellent generalization performance. However, despite these advantages, SVMs have a significant drawback in the form of high computational time, which increases with the size of the training dataset. To address this limitation, this article presents a novel adaptive sequential subsampling method designed to accelerate the training process of SVMs. The proposed method consists of two stages. In the first stage, a space-filling design is employed to group samples into cells. Then, an initial pilot SVM model is trained by utilizing the centroids and corresponding labels of these cells. In the second stage, an adaptive sequential stratified sampling method, based on the distance between each cell and the hyperplane, is employed to select informative samples, thereby enhancing the SVM model. Numerical studies show that our approach achieves classification accuracy that is comparable to or even better than that of basic SVM, while requiring only approximately 1% of the CPU time. Consequently, our algorithm is a more efficient choice for large-scale data applications.

源语言英语
主期刊名2025 4th International Conference on Image Processing, Computer Vision and Machine Learning, ICICML 2025
出版商Institute of Electrical and Electronics Engineers Inc.
1798-1807
页数10
ISBN(电子版)9798331565817
DOI
出版状态已出版 - 2025
已对外发布
活动2025 4th International Conference on Image Processing, Computer Vision and Machine Learning, ICICML 2025 - Chongqing, 中国
期限: 21 11月 202523 11月 2025

出版系列

姓名2025 4th International Conference on Image Processing, Computer Vision and Machine Learning, ICICML 2025

会议

会议2025 4th International Conference on Image Processing, Computer Vision and Machine Learning, ICICML 2025
国家/地区中国
Chongqing
时期21/11/2523/11/25

指纹

探究 'An Innovative Subsampling Approach for Efficient SVM Training with Large Datasets' 的科研主题。它们共同构成独一无二的指纹。

引用此