The aim of process discovery is to build a process model from an event log without prior information about the process. The discovery of declarative process models is useful when a process works in an unpredictable and unstable environment since several allowed paths can be represented as a compact set of rules. One of the tools available in the literature for discovering declarative models from logs is the Declare Miner, a plug-in of the process mining tool ProM. Using this plug-in, the discovered models are represented using DECLARE, a declarative process modeling language based on LTL for finite traces. However, the high execution times of the Declare Miner when processing large sets of data hampers the applicability of the tool to real-life settings. Therefore, in this paper, we propose a new approach for the discovery of DECLARE models based on the combination of an Apriori algorithm and a group of algorithms for Sequence Analysis to enhance the time performance of the plug-in. The approach has been developed in a way that it is easy to be parallelized using two different partitioning methods: the search space partitioning, in which different groups of candidate constraints are processed in parallel, and the database partitioning, in which different chunks of the log are processed at the same time. The approach has been implemented in ProM in its sequential version and in two multi-threading implementations leveraging these two partitioning methods. All the new variants of the plug-in have been evaluated using a large set of synthetic and real-life event logs.

Parallel algorithms for the automated discovery of declarative process models / Maggi, F. M.; Di Ciccio, C.; Di Francescomarino, C.; Kala, T.. - In: INFORMATION SYSTEMS. - ISSN 0306-4379. - 74:(2018), pp. 136-152. [10.1016/j.is.2017.12.002]

Parallel algorithms for the automated discovery of declarative process models

Maggi F. M.;Di Ciccio C.;
2018

Abstract

The aim of process discovery is to build a process model from an event log without prior information about the process. The discovery of declarative process models is useful when a process works in an unpredictable and unstable environment since several allowed paths can be represented as a compact set of rules. One of the tools available in the literature for discovering declarative models from logs is the Declare Miner, a plug-in of the process mining tool ProM. Using this plug-in, the discovered models are represented using DECLARE, a declarative process modeling language based on LTL for finite traces. However, the high execution times of the Declare Miner when processing large sets of data hampers the applicability of the tool to real-life settings. Therefore, in this paper, we propose a new approach for the discovery of DECLARE models based on the combination of an Apriori algorithm and a group of algorithms for Sequence Analysis to enhance the time performance of the plug-in. The approach has been developed in a way that it is easy to be parallelized using two different partitioning methods: the search space partitioning, in which different groups of candidate constraints are processed in parallel, and the database partitioning, in which different chunks of the log are processed at the same time. The approach has been implemented in ProM in its sequential version and in two multi-threading implementations leveraging these two partitioning methods. All the new variants of the plug-in have been evaluated using a large set of synthetic and real-life event logs.
2018
Apriori algorithm; Declarative process models; Process discovery; Process mining; Sequence analysis
01 Pubblicazione su rivista::01a Articolo in rivista
Parallel algorithms for the automated discovery of declarative process models / Maggi, F. M.; Di Ciccio, C.; Di Francescomarino, C.; Kala, T.. - In: INFORMATION SYSTEMS. - ISSN 0306-4379. - 74:(2018), pp. 136-152. [10.1016/j.is.2017.12.002]
File allegati a questo prodotto
File Dimensione Formato  
Maggi_Parallel-algorithms_2018.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 3.09 MB
Formato Adobe PDF
3.09 MB Adobe PDF   Contatta l'autore
Maggi_Preprint_Parallel-algorithms_2018.pdf

accesso aperto

Note: https://doi.org/10.1016/j.is.2017.12.002
Tipologia: Documento in Pre-print (manoscritto inviato all'editore, precedente alla peer review)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 772.72 kB
Formato Adobe PDF
772.72 kB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1352693
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 36
  • ???jsp.display-item.citation.isi??? 25
social impact