Event logs are data sets recording the executions (called cases) of a business process. Several process discovery algorithms have been defined to mine event logs and discover models of how activities of logged processes are being executed (activity traces). In the process discovery problems, the Pareto principle plays an important role. In fact, it is quite common that a large portion of log traces is held by a small fraction of top-frequent variants. Hence, accounting for the expected Pareto distribution of event logs, traditional process discovery algorithms are commonly used to discover process models by analyzing the prevalent trace behaviors. However, the Pareto principle is not always verified, especially in complex processes, where the majority of traces in the event log is often spanned on a high number of top-frequent trace-variants. In addition, traditional process discovery algorithms perform an offline analysis of event logs, assuming that logged processes remain in a steady state over time. But, the steady state is rarely the real-world case due to conceptual drifts. In this study, we use the traditional process discovery algorithms under the dynamic conditions of real-world processes. To this aim, we define two approaches, namely FAIRYP and FAIRYNP , which detect drifts in the conformance of traces to process models and discover new process models on drifted traces. In FAIRYP , a new process model is discovered on an extraction-based representation of a drift. In FAIRYNP , a new process model is discovered on an abstraction-based representation of a drift. The experimental results analyze the performance of the proposed approaches, also compared to a few related methods, showing the effectiveness of FAIRYP in Pareto cases and FAIRYNP in non-Pareto cases, respectively.

Handling Concept Drifts with Traditional Process Discovery Algorithms

Vincenzo Pasquadibisceglie
2025-01-01

Abstract

Event logs are data sets recording the executions (called cases) of a business process. Several process discovery algorithms have been defined to mine event logs and discover models of how activities of logged processes are being executed (activity traces). In the process discovery problems, the Pareto principle plays an important role. In fact, it is quite common that a large portion of log traces is held by a small fraction of top-frequent variants. Hence, accounting for the expected Pareto distribution of event logs, traditional process discovery algorithms are commonly used to discover process models by analyzing the prevalent trace behaviors. However, the Pareto principle is not always verified, especially in complex processes, where the majority of traces in the event log is often spanned on a high number of top-frequent trace-variants. In addition, traditional process discovery algorithms perform an offline analysis of event logs, assuming that logged processes remain in a steady state over time. But, the steady state is rarely the real-world case due to conceptual drifts. In this study, we use the traditional process discovery algorithms under the dynamic conditions of real-world processes. To this aim, we define two approaches, namely FAIRYP and FAIRYNP , which detect drifts in the conformance of traces to process models and discover new process models on drifted traces. In FAIRYP , a new process model is discovered on an extraction-based representation of a drift. In FAIRYNP , a new process model is discovered on an abstraction-based representation of a drift. The experimental results analyze the performance of the proposed approaches, also compared to a few related methods, showing the effectiveness of FAIRYP in Pareto cases and FAIRYNP in non-Pareto cases, respectively.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11586/541100
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact