Comparative study of machine-and deep-learning based classification algorithms for biomedical Raman spectroscopy (RS): case study of RS based pathogenic microbe identification

Sisi Guo, Ruoyu Zhang, Tao Wang*, Jianfeng Wang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

One key aspect pushing the frontiers of biomedical RS is dedicated machine- or deep- learning (ML or DL) algorithms. Yet, systematic comparative study between ML and DL algorithms has not been conducted for biomedical RS, largely due to the limited availability of open-source and large Raman spectra dataset. Therefore we compared typical ML partial least square-discriminant analysis (PLS-DA) and DL one dimensional convolution neural network (1D-CNN) based pathogenic microbe identification on 12,000 Raman spectra from six species of microbe (i.e., K. aerogenes (Klebsiella aerogenes), C. albicans (Candida albicans), C. glabrata (Candida glabrata), Group A Strep. (Group A Streptococcus), E. coli1 (Escherichia coli1), E. coli2 (Escherichia coli2)) when 100%, 75%, 50% and 25% of the 12,000 Raman spectra were retained. The total Raman dataset was analyzed with 80% split for training and 20% for testing. The 100% retained testing dataset accuracy, area under curve (AUC) of the receiver operating characteristic (ROC) curve were 95.25% and 0.997 for 1D-CNN, which are higher than those (89.42% and 0.979) of PLS-DA. Yet, PLS-DA outperforms 1D-CNN for 75%, 50% and 25% retained testing dataset. The resultant accuracies and AUCs demonstrated the performance reliance of PLS-DA and 1D-CNN on Raman spectra number. Besides, both loadings on the latent variables of PLS-DA and the saliency maps of 1D-CNN largely captured Raman peaks arising from DNA and proteins with comparable interpretability. The results of the current work indicated that both ML and DL algorithms should be explored for application-wise Raman spectra identification to select whichever with higher accuracies and AUCs. Graphical abstract: (Figure presented.)

Original languageEnglish
Pages (from-to)2101-2109
Number of pages9
JournalAnalytical Sciences
Volume40
Issue number12
DOIs
Publication statusPublished - Dec 2024

Keywords

  • Comparative study
  • Deep learning
  • Machine learning
  • Microbe classification
  • Raman spectroscopy

Fingerprint

Dive into the research topics of 'Comparative study of machine-and deep-learning based classification algorithms for biomedical Raman spectroscopy (RS): case study of RS based pathogenic microbe identification'. Together they form a unique fingerprint.

Cite this