Process mining aims to understand the actual behavior and performance of business processes from event logs recorded by IT systems. A key requirement is that every event in the log must be associated with a unique case identifier (e.g., the order ID in an order-to-cash process). In reality, however, this case ID may not always be present, especially when logs are acquired from different systems or when such systems have not been explicitly designed to offer process-tracking capabilities. Existing techniques for correlating events have worked with assumptions to make the problem tractable: some assume the generative processes to be acyclic while others require heuristic information or user input. In this paper, we lift these assumptions by presenting a novel technique called EC-SA based on probabilistic optimization. Given as input a sequence of timestamped events (the log without case IDs) and a process model describing the underlying business process, our approach returns an event log in which every event is mapped to a case identifier. The approach minimises the misalignment between the generated log and the input process model, and the variance between activity durations across cases. The experiments conducted on a variety of real-life datasets show the advantages of our approach over the state of the art.

A probabilistic approach to event-case correlation for process mining / Bayomie, D.; Di Ciccio, C.; La Rosa, M.; Mendling, J.. - 11788:(2019), pp. 136-152. (Intervento presentato al convegno 38th International Conference on Conceptual Modeling, ER 2019 tenutosi a Salvador; Brazil) [10.1007/978-3-030-33223-5_12].

A probabilistic approach to event-case correlation for process mining

Di Ciccio C.;
2019

Abstract

Process mining aims to understand the actual behavior and performance of business processes from event logs recorded by IT systems. A key requirement is that every event in the log must be associated with a unique case identifier (e.g., the order ID in an order-to-cash process). In reality, however, this case ID may not always be present, especially when logs are acquired from different systems or when such systems have not been explicitly designed to offer process-tracking capabilities. Existing techniques for correlating events have worked with assumptions to make the problem tractable: some assume the generative processes to be acyclic while others require heuristic information or user input. In this paper, we lift these assumptions by presenting a novel technique called EC-SA based on probabilistic optimization. Given as input a sequence of timestamped events (the log without case IDs) and a process model describing the underlying business process, our approach returns an event log in which every event is mapped to a case identifier. The approach minimises the misalignment between the generated log and the input process model, and the variance between activity durations across cases. The experiments conducted on a variety of real-life datasets show the advantages of our approach over the state of the art.
2019
38th International Conference on Conceptual Modeling, ER 2019
event correlation; process mining; simulated annealing
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
A probabilistic approach to event-case correlation for process mining / Bayomie, D.; Di Ciccio, C.; La Rosa, M.; Mendling, J.. - 11788:(2019), pp. 136-152. (Intervento presentato al convegno 38th International Conference on Conceptual Modeling, ER 2019 tenutosi a Salvador; Brazil) [10.1007/978-3-030-33223-5_12].
File allegati a questo prodotto
File Dimensione Formato  
Bayomie_Probabilistic-Approach_2019.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.06 MB
Formato Adobe PDF
1.06 MB Adobe PDF   Contatta l'autore
Bayomie_postprint_Probabilistic-Approach_2019.pdf

accesso aperto

Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.12 MB
Formato Adobe PDF
2.12 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1362067
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 13
  • ???jsp.display-item.citation.isi??? 16
social impact