TY - JOUR
T1 - Size stable batch mode model updating method
AU - He, Zhonghai
AU - Chen, Xuwang
AU - Feng, Zhanbo
AU - Zhang, Xiaofang
N1 - Publisher Copyright:
© 2024 Elsevier B.V.
PY - 2024/9
Y1 - 2024/9
N2 - In near-infrared spectroscopy analysis, the predictive performance of models may degrade due to variations in the measurement environment and sample properties. Maintenance or updates to the model are essential to uphold prediction accuracy. The state-of-the-art approach for model updates involves labeling new samples and incorporating them into the calibration set. However, this strategy leads to an escalating size of the calibration dataset, and the low ratio of new samples renders the model update process inefficient. To enhance update efficiency, a size-stable updating strategy is presented which involves the simultaneous addition of new samples and removal of old samples. This is achieved by utilizing the new set as a basis for identifying and deleting significantly different old labeled samples. To improve labeling efficiency, a batch mode update is employed. Initially, calibration samples are clustered to yield multiple clusters with comparable sizes to the new update set. Subsequently, differences between multiple old sample clusters and the new set are calculated. The old sample cluster that most different are removed, accompanied by the addition of the new sample set to maintain calibration set size stability. In calculating set similarity, a multi-indices fusion method is employed to ensure accurate judgments. The efficacy of this method is validated through simulated and real data.
AB - In near-infrared spectroscopy analysis, the predictive performance of models may degrade due to variations in the measurement environment and sample properties. Maintenance or updates to the model are essential to uphold prediction accuracy. The state-of-the-art approach for model updates involves labeling new samples and incorporating them into the calibration set. However, this strategy leads to an escalating size of the calibration dataset, and the low ratio of new samples renders the model update process inefficient. To enhance update efficiency, a size-stable updating strategy is presented which involves the simultaneous addition of new samples and removal of old samples. This is achieved by utilizing the new set as a basis for identifying and deleting significantly different old labeled samples. To improve labeling efficiency, a batch mode update is employed. Initially, calibration samples are clustered to yield multiple clusters with comparable sizes to the new update set. Subsequently, differences between multiple old sample clusters and the new set are calculated. The old sample cluster that most different are removed, accompanied by the addition of the new sample set to maintain calibration set size stability. In calculating set similarity, a multi-indices fusion method is employed to ensure accurate judgments. The efficacy of this method is validated through simulated and real data.
KW - Batch reduction processing
KW - Model update
KW - Multi-indices fusion
KW - Similarity analysis
UR - http://www.scopus.com/inward/record.url?scp=85197741459&partnerID=8YFLogxK
U2 - 10.1016/j.vibspec.2024.103717
DO - 10.1016/j.vibspec.2024.103717
M3 - Article
AN - SCOPUS:85197741459
SN - 0924-2031
VL - 134
JO - Vibrational Spectroscopy
JF - Vibrational Spectroscopy
M1 - 103717
ER -