Big Data pipelines are essential for leveraging Dark Data, i.e., data collected but not used and turned into value. However, tapping their potential requires going beyond the current approaches and frameworks for managing their life-cycle. In this paper, we present the challenges associated to the achievement of the Pipeline Discovery task, which aims to learn the structure of a Big Data pipeline by extracting, processing and interpreting huge amounts of event data produced by several data sources. Then, we discuss how traditional Process Mining solutions can be potentially employed and customized to overcome such challenges, outlining a research agenda for future work in this area.
Big Data Pipeline Discovery through Process Mining: Challenges and Research Directions / Agostinelli, Simone; Benvenuti, Dario; DE LUZI, Francesca; Marrella, Andrea. - 2952:(2021), pp. 50-55. (Intervento presentato al convegno 1st Italian Forum on Business Process Management, ITBPM 2021 tenutosi a Rome; Italy).
Big Data Pipeline Discovery through Process Mining: Challenges and Research Directions
Simone Agostinelli
;Dario Benvenuti
;Francesca De Luzi
;Andrea Marrella
2021
Abstract
Big Data pipelines are essential for leveraging Dark Data, i.e., data collected but not used and turned into value. However, tapping their potential requires going beyond the current approaches and frameworks for managing their life-cycle. In this paper, we present the challenges associated to the achievement of the Pipeline Discovery task, which aims to learn the structure of a Big Data pipeline by extracting, processing and interpreting huge amounts of event data produced by several data sources. Then, we discuss how traditional Process Mining solutions can be potentially employed and customized to overcome such challenges, outlining a research agenda for future work in this area.File | Dimensione | Formato | |
---|---|---|---|
Agostinelli_Big-data_2021.pdf
accesso aperto
Note: https://ceur-ws.org/Vol-2952/paper_298a.pdf
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Creative commons
Dimensione
346.57 kB
Formato
Adobe PDF
|
346.57 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.