Comprehensive Anomaly Score Rank Based Unsupervised Sample Selection Method

  • Zhongjiang He
  • , Zhonghai He*
  • , Xiaofang Zhang
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The process of selecting representative samples is crucial for establishing an accurate calibration model. To enhance the representativeness of the samples, a method for sample selection, utilizing the degree of anomaly as the evaluation criterion, is proposed. Initially, anomaly scores corresponding to various detection methods are obtained to ensure a comprehensive evaluation. These scores are then normalized by the confidence lower limit to establish a consistent scoring criterion. Subsequently, the weights of different detection methods are determined through eigenvector centrality analysis of a graph, where the methods serve as nodes and the similarity acts as weighted edges. Finally, the comprehensive anomaly scores are computed as the sum of weighted scores and are subsequently sorted. Representative samples are selected using a uniformly spaced sampling approach, with the spacing determined by a predefined and provided sample number. The efficacy of the method is validated across different sample sets.

Original languageEnglish
Article numbere70028
JournalJournal of Chemometrics
Volume39
Issue number4
DOIs
Publication statusPublished - Apr 2025
Externally publishedYes

Keywords

  • anomaly score
  • comprehensive representation score
  • graph centrality
  • index combination
  • unsupervised sample selection

Fingerprint

Dive into the research topics of 'Comprehensive Anomaly Score Rank Based Unsupervised Sample Selection Method'. Together they form a unique fingerprint.

Cite this