A Comparative Study of non-deep Learning, Deep Learning, and Ensemble Learning Methods for Sunspot Number Prediction

Yuchen Dang, Ziqi Chen, Heng Li, Hai Shu*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

23 Citations (Scopus)

Abstract

Solar activity has significant impacts on human activities and health. One most commonly used measure of solar activity is the sunspot number. This paper compares three important non-deep learning models, four popular deep learning models, and their five ensemble models in forecasting sunspot numbers. In particular, we propose an ensemble model called XGBoost-DL, which uses XGBoost as a two-level nonlinear ensemble method to combine the deep learning models. Our XGBoost-DL achieves the best forecasting performance (RMSE (Formula presented.) and MAE (Formula presented.)) in the comparison, outperforming the best non-deep learning model SARIMA (RMSE (Formula presented.) and MAE (Formula presented.)), the best deep learning model Informer (RMSE (Formula presented.) and MAE (Formula presented.)) and the NASA’s forecast (RMSE (Formula presented.) and MAE (Formula presented.)). Our XGBoost-DL forecasts a peak sunspot number of 133.47 in May 2025 for Solar Cycle 25 and 164.62 in November 2035 for Solar Cycle 26, similar to but later than the NASA’s at 137.7 in October 2024 and 161.2 in December 2034. An open-source Python package of our XGBoost-DL for the sunspot number prediction is available at https://github.com/yd1008/ts_ensemble_sunspot.

Original languageEnglish
Article number2074129
JournalApplied Artificial Intelligence
Volume36
Issue number1
DOIs
Publication statusPublished - 2022
Externally publishedYes

Fingerprint

Dive into the research topics of 'A Comparative Study of non-deep Learning, Deep Learning, and Ensemble Learning Methods for Sunspot Number Prediction'. Together they form a unique fingerprint.

Cite this