Process mining allows analysts to exploit logs of historical executions of business processes to extract insights regarding the actual performance of these processes. One of the most widely studied process mining operations is automated process discovery. An automated process discovery method takes as input an event log, and produces as output a business process model that captures the control-flow relations between tasks that are observed in or implied by the event log. Various automated process discovery methods have been proposed in the past two decades, striking different tradeoffs between scalability, accuracy and complexity of the resulting models. However, these methods have been evaluated in an ad-hoc manner, employing different datasets, experimental setups, evaluation measures and baselines, often leading to incomparable conclusions and sometimes unreproducible results due to the use of closed datasets. This article provides a systematic review and comparative evaluation of automated process discovery methods, using an open-source benchmark and covering twelve publicly-available real-life event logs, twelve proprietary real-life event logs, and nine quality metrics. The results highlight gaps and unexplored tradeoffs in the field, including the lack of scalability of some methods and a strong divergence in their performance with respect to the different quality metrics used.

Automated discovery of process models from event logs: review and benchmark / Augusto, Adriano; Conforti, Raffaele; Dumas, Marlon; La Rosa, Marcello; Maggi, Fabrizio M.; Marrella, Andrea; Mecella, Massimo; Soo, Allar. - In: IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING. - ISSN 1041-4347. - ELETTRONICO. - 31:4(2019), pp. 686-705. [10.1109/TKDE.2018.2841877]

Automated discovery of process models from event logs: review and benchmark

Fabrizio M. Maggi;ANDREA MARRELLA
;
Massimo Mecella;
2019

Abstract

Process mining allows analysts to exploit logs of historical executions of business processes to extract insights regarding the actual performance of these processes. One of the most widely studied process mining operations is automated process discovery. An automated process discovery method takes as input an event log, and produces as output a business process model that captures the control-flow relations between tasks that are observed in or implied by the event log. Various automated process discovery methods have been proposed in the past two decades, striking different tradeoffs between scalability, accuracy and complexity of the resulting models. However, these methods have been evaluated in an ad-hoc manner, employing different datasets, experimental setups, evaluation measures and baselines, often leading to incomparable conclusions and sometimes unreproducible results due to the use of closed datasets. This article provides a systematic review and comparative evaluation of automated process discovery methods, using an open-source benchmark and covering twelve publicly-available real-life event logs, twelve proprietary real-life event logs, and nine quality metrics. The results highlight gaps and unexplored tradeoffs in the field, including the lack of scalability of some methods and a strong divergence in their performance with respect to the different quality metrics used.
2019
automated process discovery; benchmark; Benchmark testing; Data mining; Data models; Process control; Process mining; survey; Systematics; Task analysis; Information Systems; Computer Science Applications1707 Computer Vision and Pattern Recognition; Computational Theory and Mathematics
01 Pubblicazione su rivista::01a Articolo in rivista
Automated discovery of process models from event logs: review and benchmark / Augusto, Adriano; Conforti, Raffaele; Dumas, Marlon; La Rosa, Marcello; Maggi, Fabrizio M.; Marrella, Andrea; Mecella, Massimo; Soo, Allar. - In: IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING. - ISSN 1041-4347. - ELETTRONICO. - 31:4(2019), pp. 686-705. [10.1109/TKDE.2018.2841877]
File allegati a questo prodotto
File Dimensione Formato  
Augusto_Preprint-automated-discovery_2019.pdf

accesso aperto

Note: https://ieeexplore.ieee.org/document/8368306/
Tipologia: Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 7.45 MB
Formato Adobe PDF
7.45 MB Adobe PDF
Augusto_Automated-discovery_2019.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.01 MB
Formato Adobe PDF
1.01 MB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1122758
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 268
  • ???jsp.display-item.citation.isi??? 178
social impact