Process Mining is a discipline that sits between data mining and business process management. The starting point of process mining is an event log, which is analyzed to extract useful insights and recurrent patterns about how processes are executed within organizations. However, often its concrete application is hampered by the considerable preparation effort that needs to be conducted by human experts to collect the required data for building a suitable event log. Instead, event logs need to be extracted from different and heterogeneous data sources, often using customized extraction scripts whose implementation requires both technical and domain expertise. While this is recognized as a relevant issue in the process mining community, literature solutions tend to be ad-hoc for particular application contexts, or not enough structured to be easily applied in practice. In this paper, we tackle this issue by proposing an interactive and general-purpose approach to support organizations in generating simulated event logs that can be employed to discover the structure of the data pipelines executed within a business process. A data pipeline is a composite workflow for processing data that is enacted as part of process execution. To assess the practical applicability of the approach, we show the results of a preliminary evaluation performed in a digital marketing scenario in the range of the recently funded H2020 DataCloud project.
An Interactive Approach to Support Event Log Generation for Data Pipeline Discovery / Benvenuti, D.; Falleroni, L.; Marrella, A.; Perales, F.. - (2022), pp. 1172-1177. (Intervento presentato al convegno International Computer Software and Applications Conference tenutosi a Los Alamitos; USA) [10.1109/COMPSAC54236.2022.00184].
An Interactive Approach to Support Event Log Generation for Data Pipeline Discovery
Benvenuti D.
;Marrella A.
;
2022
Abstract
Process Mining is a discipline that sits between data mining and business process management. The starting point of process mining is an event log, which is analyzed to extract useful insights and recurrent patterns about how processes are executed within organizations. However, often its concrete application is hampered by the considerable preparation effort that needs to be conducted by human experts to collect the required data for building a suitable event log. Instead, event logs need to be extracted from different and heterogeneous data sources, often using customized extraction scripts whose implementation requires both technical and domain expertise. While this is recognized as a relevant issue in the process mining community, literature solutions tend to be ad-hoc for particular application contexts, or not enough structured to be easily applied in practice. In this paper, we tackle this issue by proposing an interactive and general-purpose approach to support organizations in generating simulated event logs that can be employed to discover the structure of the data pipelines executed within a business process. A data pipeline is a composite workflow for processing data that is enacted as part of process execution. To assess the practical applicability of the approach, we show the results of a preliminary evaluation performed in a digital marketing scenario in the range of the recently funded H2020 DataCloud project.File | Dimensione | Formato | |
---|---|---|---|
Benvenuti_An-Interactive_2022.pdf
solo gestori archivio
Tipologia:
Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
432.45 kB
Formato
Adobe PDF
|
432.45 kB | Adobe PDF | Contatta l'autore |
Benvneuti_preprint_An-interactive_2022.pdf
accesso aperto
Note: DOI: 10.1109/COMPSAC54236.2022.00184
Tipologia:
Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
969.03 kB
Formato
Adobe PDF
|
969.03 kB | Adobe PDF |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.