Analysis of Machine Learning Models for Stroke Prediction with Emphasis on Hyperparameter Tuning Techniques

Sakib Hasan, Alamgir Islam, Tanjin Islam, Hongbin Ma*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Stroke remains a significant global cause of death and disability, necessitating early and accurate prediction models for prompt intervention. This study contrasts the performance of Support Vector Machine (SVM) and Random Forest (RF) models to enhance stroke prediction approaches. Emphasizing the critical role of hyper parameter adjustment in improving model efficiency, two tuning methods—Grid Search Cross-Validation (GS-CV) and Randomized Search Cross-Validation (RS-CV)—are investigated. Data prepossessing utilizes a data set from the Medical Clinic of Bangladesh, comprising 5,110 patient records. Imbalanced data is addressed through the Synthetic Minority Over-sampling Technique (SMOTE). Despite being good at predicting accuracy, SVM with RS-CV tuning is more accurate, achieving a 96% accuracy than RF with GS-CV tuning that achieves 92% accuracy. Such outcomes highlight the significance of choosing proper hyperparameter tuning techniques and ML models for stroke prediction. They also imply an outlet for use in healthcare contexts concerning early identification and prophylactic steps. This comparison study adds to the current debate about machine learning in medical prediction, focusing on the methodological aspects critical to constructing reliable and effective predictive systems.

Original languageEnglish
Title of host publicationComputational Intelligence and Industrial Applications - 11th International Symposium, ISCIIA 2024, Proceedings
EditorsBin Xin, Hongbin Ma, Jinhua She, Weihua Cao
PublisherSpringer Science and Business Media Deutschland GmbH
Pages1-9
Number of pages9
ISBN (Print)9789819647552
DOIs
Publication statusPublished - 2025
Externally publishedYes
Event11th International Symposium on Computational Intelligence and Industrial Applications, ISCIIA 2024 - Beijing, China
Duration: 1 Nov 20245 Nov 2024

Publication series

NameCommunications in Computer and Information Science
Volume2466 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference11th International Symposium on Computational Intelligence and Industrial Applications, ISCIIA 2024
Country/TerritoryChina
CityBeijing
Period1/11/245/11/24

Keywords

  • Data Prepossessing
  • Grid Search Cross-Validation
  • Health care Technology
  • Hyper parameter Tuning
  • Machine Learning
  • Random Forest
  • Randomized Search Cross-Validation
  • Stroke Prediction
  • Support Vector Machine
  • Synthetic Minority Over-sampling Technique (SMOTE)

Cite this