TY - JOUR
T1 - Design and experiment verification of a novel analysis framework for recognition of driver injury patterns
T2 - From a multi-class classification perspective
AU - Zhu, Mengtao
AU - Li, Yunjie
AU - Wang, Yinhai
N1 - Publisher Copyright:
© 2018 Elsevier Ltd
PY - 2018/11
Y1 - 2018/11
N2 - Detecting driver injury patterns is a typical classification problem. Crash data sets are highly skewed where fatalities and severe injuries are often less represented compared to other events. The severity prediction performance of the existing models is poor due to the highly imbalanced samples of different severity levels within a given dataset. This paper proposes a machine learning based analysis framework from a multi-class classification perspective for accurate recognition of the driver injury patterns. The proposed framework includes preprocessing, classification, evaluation and application of a given dataset. This framework is verified based on the three years single-vehicle ROR (run-off-road) crash records collected in Washington State from 2011 to 2013. At first, thirteen most important safety-related variables are recognized through random forests. Then, the four driver's injury severity levels viz., fatal/serious injury, evident injury, possible injury, and no injury are predicted by integrating the decomposed binary neural network models to achieve better performance. Finally, a sensitivity analysis is carried out to interpret variables’ impacts on the decomposed injury severity levels. The study shows that lack of restraint, female drivers, truck usage, driver impairment, driver distraction, vehicle overturn (rollover), dawn/dusk, and overtaking are the leading factors contributing to the driver fatalities or severe injuries in a single-vehicle ROR crash. Most of the findings are consistent with the previous studies. The experimental results validate the effectiveness of the proposed framework which can be further applied for pattern recognition in traffic safety research.
AB - Detecting driver injury patterns is a typical classification problem. Crash data sets are highly skewed where fatalities and severe injuries are often less represented compared to other events. The severity prediction performance of the existing models is poor due to the highly imbalanced samples of different severity levels within a given dataset. This paper proposes a machine learning based analysis framework from a multi-class classification perspective for accurate recognition of the driver injury patterns. The proposed framework includes preprocessing, classification, evaluation and application of a given dataset. This framework is verified based on the three years single-vehicle ROR (run-off-road) crash records collected in Washington State from 2011 to 2013. At first, thirteen most important safety-related variables are recognized through random forests. Then, the four driver's injury severity levels viz., fatal/serious injury, evident injury, possible injury, and no injury are predicted by integrating the decomposed binary neural network models to achieve better performance. Finally, a sensitivity analysis is carried out to interpret variables’ impacts on the decomposed injury severity levels. The study shows that lack of restraint, female drivers, truck usage, driver impairment, driver distraction, vehicle overturn (rollover), dawn/dusk, and overtaking are the leading factors contributing to the driver fatalities or severe injuries in a single-vehicle ROR crash. Most of the findings are consistent with the previous studies. The experimental results validate the effectiveness of the proposed framework which can be further applied for pattern recognition in traffic safety research.
KW - Injury severity pattern
KW - Machine learning
KW - Multi-class imbalanced learning framework
KW - Run-off-road crash
KW - Sensitivity analysis
KW - Traffic safety
UR - http://www.scopus.com/inward/record.url?scp=85051677244&partnerID=8YFLogxK
U2 - 10.1016/j.aap.2018.08.011
DO - 10.1016/j.aap.2018.08.011
M3 - Article
C2 - 30138770
AN - SCOPUS:85051677244
SN - 0001-4575
VL - 120
SP - 152
EP - 164
JO - Accident Analysis and Prevention
JF - Accident Analysis and Prevention
ER -