基于自适应密度聚类的多准则主动学习方法

Translated title of the contribution: A multi-criteria active learning method based on adaptive density clustering

Zhonghai He*, Wenhan Zhu, Xuwang Chen, Xiaofang Zhang

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Active learning proves instrumental in training superior machine learning models while minimizing labeling costs. The combination of RD and QBC algorithms effectively addresses issues associated with considering only a single criterion. However, the K-means clustering upon which RD is based may include outliers, leading to a decrease in model performance, and QBC requires maintaining multiple models and indirectly provides sample information. To address these issues, we propose an adaptive density clustering-based Gaussian process regression (ADC-GPR) algorithm, which efficiently selects samples by first clustering and then utilizing uncertainty directly. The ADC clustering in this algorithm is not only robust against outliers but also adapts to the distribution characteristics of the dataset, providing representative sample points and their corresponding clusters for subsequent AL. This method ensures both representativeness and diversity in unsupervised selection and considers informativeness, representativeness, and diversity in supervised selection. The experimental results demonstrate that compared to the RS, KS, and RD-GPR algorithms, the ADC-GPR algorithm exhibits an average performance improvement of 37. 3%, 8%, and 2. 8% respectively, with the same number of sampling iterations. Furthermore, the ADC-GPR algorithm demonstrates higher selection efficiency.

Translated title of the contributionA multi-criteria active learning method based on adaptive density clustering
Original languageChinese (Traditional)
Pages (from-to)179-187
Number of pages9
JournalYi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument
Volume45
Issue number3
DOIs
Publication statusPublished - Mar 2024

Fingerprint

Dive into the research topics of 'A multi-criteria active learning method based on adaptive density clustering'. Together they form a unique fingerprint.

Cite this