TY - JOUR
T1 - Similarity based spectral data fusion physical parameter regression modeling method
AU - He, Zhonghai
AU - Zhang, Haoxiang
AU - Zhang, Yi
AU - Zhang, Xiaofang
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
PY - 2025/7
Y1 - 2025/7
N2 - Spectroscopy are widely used in routine concentration measurement. However, when spectral measurement is carried out in process industry, the measurement environment often changes thus the prediction accuracy of the regression model is spoiled. Existing studies regard the measurement environment change as noise, but in fact, the measurement environment also contains useful information. In this paper, a modeling method is proposed to augment the measured environmental parameters (physical quantities) into the calibration modeling to improve the prediction accuracy. To solve the problem of physical quantity parameters being overridden caused by direct variable extension method, we use the data fusion method based on sample similarity. Gaussian kernel function is used to calculate the similarity matrix of spectral and physical quantities respectively. Then fusion matrix is obtained by weighting combination. Finally, the regression model of fusion matrix and concentration is established by standard PLS modeling method. A regression model is established for the data collected during the fermentation process. The results showed that the prediction performance of the model could be improved by nearly 10 % by adding physical quantity information.
AB - Spectroscopy are widely used in routine concentration measurement. However, when spectral measurement is carried out in process industry, the measurement environment often changes thus the prediction accuracy of the regression model is spoiled. Existing studies regard the measurement environment change as noise, but in fact, the measurement environment also contains useful information. In this paper, a modeling method is proposed to augment the measured environmental parameters (physical quantities) into the calibration modeling to improve the prediction accuracy. To solve the problem of physical quantity parameters being overridden caused by direct variable extension method, we use the data fusion method based on sample similarity. Gaussian kernel function is used to calculate the similarity matrix of spectral and physical quantities respectively. Then fusion matrix is obtained by weighting combination. Finally, the regression model of fusion matrix and concentration is established by standard PLS modeling method. A regression model is established for the data collected during the fermentation process. The results showed that the prediction performance of the model could be improved by nearly 10 % by adding physical quantity information.
KW - Data fusion
KW - Kernel width determination
KW - Physical quantity modeling
KW - Similarity method
UR - http://www.scopus.com/inward/record.url?scp=105005837916&partnerID=8YFLogxK
U2 - 10.1016/j.vibspec.2025.103812
DO - 10.1016/j.vibspec.2025.103812
M3 - Article
AN - SCOPUS:105005837916
SN - 0924-2031
VL - 139
JO - Vibrational Spectroscopy
JF - Vibrational Spectroscopy
M1 - 103812
ER -