Road traffic accidents are a significant global health concern with far-reaching economic consequences. In an innovative bid to address this issue, our study predicts accident severity in Rome, leveraging a comprehensive dataset from 2006 to 2022, a first in the literature. We analyzed multiple factors, including weather conditions, road and vehicle conditions, types of accidents, and time-related aspects. Distinguishing our study, we applied one-hot encoding to categorical variables, demonstrating superior model performance over traditional label encoding. Additionally, we employed the Synthetic Minority Over-sampling Technique (SMOTE) to handle data imbalance and provided a detailed analysis of its impacts on model performance. Our key innovation lies in implementing conformal prediction to quantify prediction uncertainty. Given the prevalent skewness in traffic accident datasets, this technique enhances decisionmaking reliability and precision. We deployed a series of machine learning models, with the Extreme Gradient Boost (XGBoost) algorithm outperforming others in predicting injury severity, boasting a remarkable 77% accuracy rate. Implementing SHapley Additive exPlanations (SHAP) ensured model interpretability, underscoring the type of vehicles involved, the nature of the accident, and road shape as the most influential factors. In conclusion, our approach combining XGBoost, one-hot encoding, SMOTE, and conformal prediction provides a comprehensive, efficient, and transparent method for predicting road accident severity. Incorporating conformal prediction offers critical insights into model uncertainty, significantly aiding decision-making in road traffic safety. Additionally, SHAP analysis highlights the key factors contributing to accident severity, guiding focused preventive strategies.

Predictive analytics for road traffic accidents: exploring severity through conformal prediction / Eldafrawi, Mohamed; Varghese, KEN KOSHY; Afsari, Marzieh; Babapourdijojin, Mahnaz; Gentile, Guido. - (2024). (Intervento presentato al convegno 2024 TRB Annual Meeting tenutosi a Washington DC, USA).

Predictive analytics for road traffic accidents: exploring severity through conformal prediction

Mohamed Eldafrawi
;
Ken Koshy Varghese
;
Marzieh Afsari
;
Mahnaz Babapourdijojin
;
Guido Gentile
2024

Abstract

Road traffic accidents are a significant global health concern with far-reaching economic consequences. In an innovative bid to address this issue, our study predicts accident severity in Rome, leveraging a comprehensive dataset from 2006 to 2022, a first in the literature. We analyzed multiple factors, including weather conditions, road and vehicle conditions, types of accidents, and time-related aspects. Distinguishing our study, we applied one-hot encoding to categorical variables, demonstrating superior model performance over traditional label encoding. Additionally, we employed the Synthetic Minority Over-sampling Technique (SMOTE) to handle data imbalance and provided a detailed analysis of its impacts on model performance. Our key innovation lies in implementing conformal prediction to quantify prediction uncertainty. Given the prevalent skewness in traffic accident datasets, this technique enhances decisionmaking reliability and precision. We deployed a series of machine learning models, with the Extreme Gradient Boost (XGBoost) algorithm outperforming others in predicting injury severity, boasting a remarkable 77% accuracy rate. Implementing SHapley Additive exPlanations (SHAP) ensured model interpretability, underscoring the type of vehicles involved, the nature of the accident, and road shape as the most influential factors. In conclusion, our approach combining XGBoost, one-hot encoding, SMOTE, and conformal prediction provides a comprehensive, efficient, and transparent method for predicting road accident severity. Incorporating conformal prediction offers critical insights into model uncertainty, significantly aiding decision-making in road traffic safety. Additionally, SHAP analysis highlights the key factors contributing to accident severity, guiding focused preventive strategies.
2024
2024 TRB Annual Meeting
road accident severity; extreme gradient boosting; onehot encoding; synthetic minority oversampling technique; conformal prediction; shap
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Predictive analytics for road traffic accidents: exploring severity through conformal prediction / Eldafrawi, Mohamed; Varghese, KEN KOSHY; Afsari, Marzieh; Babapourdijojin, Mahnaz; Gentile, Guido. - (2024). (Intervento presentato al convegno 2024 TRB Annual Meeting tenutosi a Washington DC, USA).
File allegati a questo prodotto
File Dimensione Formato  
Eldafrawi_Predictive-analytics-road_2024.pdf

solo gestori archivio

Note: contributo
Tipologia: Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.61 MB
Formato Adobe PDF
2.61 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1690432
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact