基于自适应密度聚类的多准则主动学习方法

Zhonghai He*, Wenhan Zhu, Xuwang Chen, Xiaofang Zhang

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Active learning proves instrumental in training superior machine learning models while minimizing labeling costs. The combination of RD and QBC algorithms effectively addresses issues associated with considering only a single criterion. However, the K-means clustering upon which RD is based may include outliers, leading to a decrease in model performance, and QBC requires maintaining multiple models and indirectly provides sample information. To address these issues, we propose an adaptive density clustering-based Gaussian process regression (ADC-GPR) algorithm, which efficiently selects samples by first clustering and then utilizing uncertainty directly. The ADC clustering in this algorithm is not only robust against outliers but also adapts to the distribution characteristics of the dataset, providing representative sample points and their corresponding clusters for subsequent AL. This method ensures both representativeness and diversity in unsupervised selection and considers informativeness, representativeness, and diversity in supervised selection. The experimental results demonstrate that compared to the RS, KS, and RD-GPR algorithms, the ADC-GPR algorithm exhibits an average performance improvement of 37. 3%, 8%, and 2. 8% respectively, with the same number of sampling iterations. Furthermore, the ADC-GPR algorithm demonstrates higher selection efficiency.

投稿的翻译标题A multi-criteria active learning method based on adaptive density clustering
源语言繁体中文
页(从-至)179-187
页数9
期刊Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument
45
3
DOI
出版状态已出版 - 3月 2024

关键词

  • active learning
  • adaptive density clustering
  • Gaussian process regression
  • multi-criteria fusion
  • outlier robustness

指纹

探究 '基于自适应密度聚类的多准则主动学习方法' 的科研主题。它们共同构成独一无二的指纹。

引用此