Hydrogen holds significant potential for decarbonizing various industries, including energy and mobility. However, the limited availability of accident data poses a significant challenge to effective safety risk analysis and assessment. This study leverages large language models to address the critical task of filling gaps in the Hydrogen Incidents and Accidents Database (HIAD) 2.1, a prominent repository of hydrogen-related unwanted events. A three-step Artificial Intelligence-driven algorithm is proposed: (i) a preprocessing phase to standardize and prepare an event description, (ii) a processing phase utilizing OpenAI's sentence embedding technology to extract semantic relationships, and (iii) an enhancement phase employing trained multilayer perceptrons to impute missing data. The algorithm demonstrates promising results in predicting categorical entries and is applied to enhance the entire database, with a specific focus on the 2019 fueling station fire in Sandvika (Norway). This case study highlights the proposed algorithm's potential to improve our understanding of hydrogen-related incidents and contribute to enhanced risk management strategies.

Enhancement of a Hydrogen Incident and Accident Database Using Large Language Models / Tabella, Gianluca; De Fazio, Ivan; Ayalew Belay, Mohammed; Stefana, Elena; Cozzani, Valerio; Paltrinieri, Nicola; Bucelli, Marta. - (2025), pp. 1185-1192. ( 35th European Safety and Reliability & 33rd Society for Risk Analysis Europe Conference Stavanger, Norway ) [10.3850/978-981-94-3281-3_ESREL-SRA-E2025-P4039-cd].

Enhancement of a Hydrogen Incident and Accident Database Using Large Language Models

Elena Stefana;Nicola Paltrinieri;
2025

Abstract

Hydrogen holds significant potential for decarbonizing various industries, including energy and mobility. However, the limited availability of accident data poses a significant challenge to effective safety risk analysis and assessment. This study leverages large language models to address the critical task of filling gaps in the Hydrogen Incidents and Accidents Database (HIAD) 2.1, a prominent repository of hydrogen-related unwanted events. A three-step Artificial Intelligence-driven algorithm is proposed: (i) a preprocessing phase to standardize and prepare an event description, (ii) a processing phase utilizing OpenAI's sentence embedding technology to extract semantic relationships, and (iii) an enhancement phase employing trained multilayer perceptrons to impute missing data. The algorithm demonstrates promising results in predicting categorical entries and is applied to enhance the entire database, with a specific focus on the 2019 fueling station fire in Sandvika (Norway). This case study highlights the proposed algorithm's potential to improve our understanding of hydrogen-related incidents and contribute to enhanced risk management strategies.
2025
35th European Safety and Reliability & 33rd Society for Risk Analysis Europe Conference
Hydrogen; HIAD; Safety; Large language model; Artificial intelligence; Deep learning; Machine learning
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Enhancement of a Hydrogen Incident and Accident Database Using Large Language Models / Tabella, Gianluca; De Fazio, Ivan; Ayalew Belay, Mohammed; Stefana, Elena; Cozzani, Valerio; Paltrinieri, Nicola; Bucelli, Marta. - (2025), pp. 1185-1192. ( 35th European Safety and Reliability & 33rd Society for Risk Analysis Europe Conference Stavanger, Norway ) [10.3850/978-981-94-3281-3_ESREL-SRA-E2025-P4039-cd].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1764797
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact