TY - JOUR
T1 - Leveraging explainable artificial intelligence in understanding public transportation usage rates for sustainable development
AU - Sariyer, Gorkem
AU - Mangla, Sachin Kumar
AU - Sozen, Mert Erkan
AU - Li, Guo
AU - Kazancoglu, Yigit
N1 - Publisher Copyright:
© 2024 Elsevier Ltd
PY - 2024/9
Y1 - 2024/9
N2 - Public transportation usage prediction is valuable for the sustainable development of transportation systems, particularly in crowded megacities. Machine learning technologies are of great interest for predicting public transportation usage. While these technologies outperform many other techniques, they suffer from limited interpretability. Explainable artificial intelligence (XAI) tools and techniques that offer post-hoc explanations of the obtained predictions are gaining popularity. This paper proposes an advanced tree-based ensemble algorithm for public transportation usage rate prediction. We aim to explain the predictions both with the most widely used technique of XAI, Shapley additive explanation (SHAP) and in the light of the rules presented. To predict the total public transportation usage, the proposed model combines all types of public transportation, categorized as ferry, railway, and bus, unlike most existing studies focusing on a single kind of public transport. Besides the sort of transportation, the day of the week, whether the day is special, and the daily ratio of passenger types were identified as model features for predicting the daily usage of each type of public transportation. We tested the proposed model using an open data set from Izmir City, Turkey. While the model had superior prediction performance, the explanations showed that the type of public transportation, weekday, and the ratio of full-fare passengers have the highest SHAP values, and the model features have many interactions. We also validated our results using an online data set showing Google search trends.
AB - Public transportation usage prediction is valuable for the sustainable development of transportation systems, particularly in crowded megacities. Machine learning technologies are of great interest for predicting public transportation usage. While these technologies outperform many other techniques, they suffer from limited interpretability. Explainable artificial intelligence (XAI) tools and techniques that offer post-hoc explanations of the obtained predictions are gaining popularity. This paper proposes an advanced tree-based ensemble algorithm for public transportation usage rate prediction. We aim to explain the predictions both with the most widely used technique of XAI, Shapley additive explanation (SHAP) and in the light of the rules presented. To predict the total public transportation usage, the proposed model combines all types of public transportation, categorized as ferry, railway, and bus, unlike most existing studies focusing on a single kind of public transport. Besides the sort of transportation, the day of the week, whether the day is special, and the daily ratio of passenger types were identified as model features for predicting the daily usage of each type of public transportation. We tested the proposed model using an open data set from Izmir City, Turkey. While the model had superior prediction performance, the explanations showed that the type of public transportation, weekday, and the ratio of full-fare passengers have the highest SHAP values, and the model features have many interactions. We also validated our results using an online data set showing Google search trends.
KW - Machine learning
KW - Public transportation usage
KW - Rule-based explanation
KW - SHAP
KW - XAI
UR - http://www.scopus.com/inward/record.url?scp=85192144152&partnerID=8YFLogxK
U2 - 10.1016/j.omega.2024.103105
DO - 10.1016/j.omega.2024.103105
M3 - Article
AN - SCOPUS:85192144152
SN - 0305-0483
VL - 127
JO - Omega (United Kingdom)
JF - Omega (United Kingdom)
M1 - 103105
ER -