Since its emergence over two decades ago, process mining has flourished as a discipline, with numerous contributions to its theory, widespread practical applications, and mature support by commercial tooling environments. However, its potential for significant organisational impact is hampered by poor quality event data. Process mining starts with the acquisition and preparation of event data coming from different data sources. These are then transformed into event logs, consisting of process execution traces including multiple events. In real-life scenarios, event logs suffer from significant data quality problems, which must be recognised and effectively resolved for obtaining meaningful insights from process mining analysis. Despite its importance, the topic of data quality in process mining has received limited attention. In this paper, we discuss the emerging challenges related to process-data quality from both a research and practical point of view. Additionally, we present a corresponding research agenda with key research directions.

Process-Data Quality: The True Frontier of Process Mining / Ter Hofstede, Arthur H. M.; Koschmider, Agnes; Marrella, Andrea; Andrews, Robert; Fischer, Dominik A.; Sadeghianasl, Sareh; Thandar Wynn, Moe; Comuzzi, Marco; De Weerdt, Jochen; Goel, Kanika; Martin, Niels; Soffer, Pnina. - In: ACM JOURNAL OF DATA AND INFORMATION QUALITY. - ISSN 1936-1955. - 15:3(2023), pp. 1-21. [10.1145/3613247]

Process-Data Quality: The True Frontier of Process Mining

ANDREA MARRELLA
;
2023

Abstract

Since its emergence over two decades ago, process mining has flourished as a discipline, with numerous contributions to its theory, widespread practical applications, and mature support by commercial tooling environments. However, its potential for significant organisational impact is hampered by poor quality event data. Process mining starts with the acquisition and preparation of event data coming from different data sources. These are then transformed into event logs, consisting of process execution traces including multiple events. In real-life scenarios, event logs suffer from significant data quality problems, which must be recognised and effectively resolved for obtaining meaningful insights from process mining analysis. Despite its importance, the topic of data quality in process mining has received limited attention. In this paper, we discuss the emerging challenges related to process-data quality from both a research and practical point of view. Additionally, we present a corresponding research agenda with key research directions.
2023
event data quality; process mining; event log
01 Pubblicazione su rivista::01a Articolo in rivista
Process-Data Quality: The True Frontier of Process Mining / Ter Hofstede, Arthur H. M.; Koschmider, Agnes; Marrella, Andrea; Andrews, Robert; Fischer, Dominik A.; Sadeghianasl, Sareh; Thandar Wynn, Moe; Comuzzi, Marco; De Weerdt, Jochen; Goel, Kanika; Martin, Niels; Soffer, Pnina. - In: ACM JOURNAL OF DATA AND INFORMATION QUALITY. - ISSN 1936-1955. - 15:3(2023), pp. 1-21. [10.1145/3613247]
File allegati a questo prodotto
File Dimensione Formato  
TerHofstede_Process-data_2023.pdf

accesso aperto

Note: https://doi.org/10.1145/3613247
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 1.13 MB
Formato Adobe PDF
1.13 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1691545
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 5
social impact