Bike-sharing systems (BSS) are being studied for their potential to enhance urban accessibility and sustainable mobility. Despite its popularity, effectively managing BSS encounters chal-lenges due to demand-supply imbalances. The significance of BSS lies in accurately predicting bike demand at various stations, a task we approach using regression methods such as Ran-dom Forest (RF), eXtreme Gradient Boosting (XGBoost), Regularized Linear Regression (Ridge), and Least Absolute Shrinkage and Selection Operator (LASSO), for short-term pre-dictions. The study utilizes data from Los Angeles on bicycle-sharing and weather conditions, alongside the p-median method for station clustering. The results demonstrate that combining both datasets yields accurate predictions at the city level, with an error rate of 0.1%, and at the station level at 18%. Notably, RF emerges as the most accurate method among the regression models examined.
Using Machine Learning to Assess the Relevance of Weather Conditions for Short-Term Demand Predictions in Bike-Sharing Systems / Afsari, Marzieh; Dastmard, Mousaalreza; Gentile, Guido. - (2024). (Intervento presentato al convegno 12th Symposium of the European Association for Research in Transportation tenutosi a Finland).
Using Machine Learning to Assess the Relevance of Weather Conditions for Short-Term Demand Predictions in Bike-Sharing Systems
Marzieh Afsari;Mousaalreza Dastmard;Guido Gentile
2024
Abstract
Bike-sharing systems (BSS) are being studied for their potential to enhance urban accessibility and sustainable mobility. Despite its popularity, effectively managing BSS encounters chal-lenges due to demand-supply imbalances. The significance of BSS lies in accurately predicting bike demand at various stations, a task we approach using regression methods such as Ran-dom Forest (RF), eXtreme Gradient Boosting (XGBoost), Regularized Linear Regression (Ridge), and Least Absolute Shrinkage and Selection Operator (LASSO), for short-term pre-dictions. The study utilizes data from Los Angeles on bicycle-sharing and weather conditions, alongside the p-median method for station clustering. The results demonstrate that combining both datasets yields accurate predictions at the city level, with an error rate of 0.1%, and at the station level at 18%. Notably, RF emerges as the most accurate method among the regression models examined.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


