Abstract
The process of selecting representative samples is crucial for establishing an accurate calibration model. To enhance the representativeness of the samples, a method for sample selection, utilizing the degree of anomaly as the evaluation criterion, is proposed. Initially, anomaly scores corresponding to various detection methods are obtained to ensure a comprehensive evaluation. These scores are then normalized by the confidence lower limit to establish a consistent scoring criterion. Subsequently, the weights of different detection methods are determined through eigenvector centrality analysis of a graph, where the methods serve as nodes and the similarity acts as weighted edges. Finally, the comprehensive anomaly scores are computed as the sum of weighted scores and are subsequently sorted. Representative samples are selected using a uniformly spaced sampling approach, with the spacing determined by a predefined and provided sample number. The efficacy of the method is validated across different sample sets.
| Original language | English |
|---|---|
| Article number | e70028 |
| Journal | Journal of Chemometrics |
| Volume | 39 |
| Issue number | 4 |
| DOIs | |
| Publication status | Published - Apr 2025 |
| Externally published | Yes |
Keywords
- anomaly score
- comprehensive representation score
- graph centrality
- index combination
- unsupervised sample selection