Road traffic accidents are a significant global health concern with far-reaching economic consequences. In an innovative bid to address this issue, our study predicts accident severity in Rome, leveraging a comprehensive dataset from 2006 to 2022, a first in the literature. We analyzed multiple factors, including weather conditions, road and vehicle conditions, types of accidents, and time-related aspects. Distinguishing our study, we applied one-hot encoding to categorical variables, demonstrating superior model performance over traditional label encoding. Additionally, we employed the Synthetic Minority Over-sampling Technique (SMOTE) to handle data imbalance and provided a detailed analysis of its impacts on model performance. Our key innovation lies in implementing conformal prediction to quantify prediction uncertainty. Given the prevalent skewness in traffic accident datasets, this technique enhances decisionmaking reliability and precision. We deployed a series of machine learning models, with the Extreme Gradient Boost (XGBoost) algorithm outperforming others in predicting injury severity, boasting a remarkable 77% accuracy rate. Implementing SHapley Additive exPlanations (SHAP) ensured model interpretability, underscoring the type of vehicles involved, the nature of the accident, and road shape as the most influential factors. In conclusion, our approach combining XGBoost, one-hot encoding, SMOTE, and conformal prediction provides a comprehensive, efficient, and transparent method for predicting road accident severity. Incorporating conformal prediction offers critical insights into model uncertainty, significantly aiding decision-making in road traffic safety. Additionally, SHAP analysis highlights the key factors contributing to accident severity, guiding focused preventive strategies.
Predictive analytics for road traffic accidents: exploring severity through conformal prediction / Eldafrawi, Mohamed; Varghese, KEN KOSHY; Afsari, Marzieh; Babapourdijojin, Mahnaz; Gentile, Guido. - (2024). (Intervento presentato al convegno 2024 TRB Annual Meeting tenutosi a Washington DC, USA).
Predictive analytics for road traffic accidents: exploring severity through conformal prediction
Mohamed Eldafrawi
;Ken Koshy Varghese
;Marzieh Afsari
;Mahnaz Babapourdijojin
;Guido Gentile
2024
Abstract
Road traffic accidents are a significant global health concern with far-reaching economic consequences. In an innovative bid to address this issue, our study predicts accident severity in Rome, leveraging a comprehensive dataset from 2006 to 2022, a first in the literature. We analyzed multiple factors, including weather conditions, road and vehicle conditions, types of accidents, and time-related aspects. Distinguishing our study, we applied one-hot encoding to categorical variables, demonstrating superior model performance over traditional label encoding. Additionally, we employed the Synthetic Minority Over-sampling Technique (SMOTE) to handle data imbalance and provided a detailed analysis of its impacts on model performance. Our key innovation lies in implementing conformal prediction to quantify prediction uncertainty. Given the prevalent skewness in traffic accident datasets, this technique enhances decisionmaking reliability and precision. We deployed a series of machine learning models, with the Extreme Gradient Boost (XGBoost) algorithm outperforming others in predicting injury severity, boasting a remarkable 77% accuracy rate. Implementing SHapley Additive exPlanations (SHAP) ensured model interpretability, underscoring the type of vehicles involved, the nature of the accident, and road shape as the most influential factors. In conclusion, our approach combining XGBoost, one-hot encoding, SMOTE, and conformal prediction provides a comprehensive, efficient, and transparent method for predicting road accident severity. Incorporating conformal prediction offers critical insights into model uncertainty, significantly aiding decision-making in road traffic safety. Additionally, SHAP analysis highlights the key factors contributing to accident severity, guiding focused preventive strategies.File | Dimensione | Formato | |
---|---|---|---|
Eldafrawi_Predictive-analytics-road_2024.pdf
solo gestori archivio
Note: contributo
Tipologia:
Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
2.61 MB
Formato
Adobe PDF
|
2.61 MB | Adobe PDF | Contatta l'autore |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.