541 research outputs found
Unsupervised event abstraction using pattern abstraction and local process models
Process mining analyzes business processes based on events stored in event logs. However, some recorded events may correspond to activities on a very low level of abstraction. When events are recorded on a too low level of abstraction, process discovery methods tend to generate overgeneralizing process models. Grouping low-level events to higher level activities, i.e., event abstraction, can be used to discover better process models. Existing event abstraction methods are mainly based on common sub-sequences and clustering techniques. In this paper, we propose to first discover local process models and, then, use those models to lift the event log to a higher level of abstraction. Our conjecture is that process models discovered on the obtained high-level event log return process models of higher quality: their fitness and precision scores are more balanced. We show this with preliminary results on several real-life event logs
Unsupervised event abstraction using pattern abstraction and local process models
Process mining analyzes business processes based on events stored in event logs. However, some recorded events may correspond to activities on a very low level of abstraction. When events are recorded on a too low level of abstraction, process discovery methods tend to generate overgeneralizing process models. Grouping low-level events to higher level activities, i.e., event abstraction, can be used to discover better process models. Existing event abstraction methods are mainly based on common sub-sequences and clustering techniques. In this paper, we propose to first discover local process models and, then, use those models to lift the event log to a higher level of abstraction. Our conjecture is that process models discovered on the obtained high-level event log return process models of higher quality: their fitness and precision scores are more balanced. We show this with preliminary results on several real-life event logs
On the use of hierarchical subtrace mining for efficient local process model mining
Mining local patterns of process behavior is a vital tool for the analysis of event data that originates from flexible processes, for which it is generally not possible to describe the behavior of the process in a single process model without overgeneralizing the behavior allowed by the process. Several techniques for mining such local patterns have been developed throughout the years, including Local Process Model (LPM) mining and the hierarchical mining of frequent subtraces (i.e., subprocesses). These two techniques can be considered to be orthogonal, i.e., they provide different types of insights on the behavior observed in an event log. As a consequence, it is often useful to apply both techniques to the data. However, both techniques can be computationally intensive, hindering data analysis. In this work, we explore how the output of a subtrace mining approach can be used to mine LPMs more efficiently. We show on a collection of real-life event logs that exploiting the ordering constraints extracted from subtraces lowers the computation time needed for LPM mining compared to state-of-the-art techniques, while at the same time mining higher quality LPMs. Additionally, by mining LPMs from subtraces, we can obtain a more structured and meaningful representation of subprocesses allowing for classic process-flow constructs such as parallel ordering, choices, and loops, besides the precedence relations shown by subtraces.</p
On generation of time-based label refinements
Process mining is a research field focused on the analysis of event data with
the aim of extracting insights in processes. Applying process mining techniques
on data from smart home environments has the potential to provide valuable
insights in (un)healthy habits and to contribute to ambient assisted living
solutions. Finding the right event labels to enable application of process
mining techniques is however far from trivial, as simply using the triggering
sensor as the label for sensor events results in uninformative models that
allow for too much behavior (overgeneralizing). Refinements of sensor level
event labels suggested by domain experts have shown to enable discovery of more
precise and insightful process models. However, there exist no automated
approach to generate refinements of event labels in the context of process
mining. In this paper we propose a framework for automated generation of label
refinements based on the time attribute of events. We show on a case study with
real life smart home event data that behaviorally more specific, and therefore
more insightful, process models can be found by using automatically generated
refined labels in process discovery.Comment: Accepted at CS&P workshop 2016 Overlap in preliminaries with
arXiv:1606.0725
Heuristics for high-utility local process model mining
Local Process Models (LPMs) describe structured fragments of process behavior occurring in the context of less structured business processes. In contrast to traditional support-based LPM discovery, which aims to generate a collection of process models that describe highly frequent behavior, High-Utility Local Process Model (HU-LPM) discovery aims to generate a collection of process models that provide useful business insights by specifying a utility function. Mining LPMs is a computationally expensive task, because of the large search space of LPMs. In supportbased LPM mining, the search space is constrained by making use of the property that support is anti-monotonic. We show that in general, we cannot assume a provided utility function to be anti-monotonic, therefore, the search space of HU-LPMs cannot be reduced without loss. We propose four heuristic methods to speed up the mining of HU-LPMs while still being able to discover useful HU-LPMs. We demonstrate their applicability on three real-life data sets
Classifying Process Instances Using Recurrent Neural Networks
Process Mining consists of techniques where logs created by operative systems are transformed into process models. In process mining tools it is often desired to be able to classify ongoing process instances, e.g., to predict how long the process will still require to complete, or to classify process instances to different classes based only on the activities that have occurred in the process instance thus far. Recurrent neural networks and its subclasses, such as Gated Recurrent Unit (GRU) and Long Short-Term Memory (LSTM), have been demonstrated to be able to learn relevant temporal features for subsequent classification tasks. In this paper we apply recurrent neural networks to classifying process instances. The proposed model is trained in a supervised fashion using labeled process instances extracted from event log traces. This is the first time we know of GRU having been used in classifying business process instances. Our main experimental results shows that GRU outperforms LSTM remarkably in training time while giving almost identical accuracies to LSTM models. Additional contributions of our paper are improving the classification model training time by filtering infrequent activities, which is a technique commonly used, e.g., in Natural Language Processing (NLP).Peer reviewe
DeepAlign: Alignment-based Process Anomaly Correction using Recurrent Neural Networks
In this paper, we propose DeepAlign, a novel approach to multi-perspective
process anomaly correction, based on recurrent neural networks and
bidirectional beam search. At the core of the DeepAlign algorithm are two
recurrent neural networks trained to predict the next event. One is reading
sequences of process executions from left to right, while the other is reading
the sequences from right to left. By combining the predictive capabilities of
both neural networks, we show that it is possible to calculate sequence
alignments, which are used to detect and correct anomalies. DeepAlign utilizes
the case-level and event-level attributes to closely model the decisions within
a process. We evaluate the performance of our approach on an elaborate data
corpus of 252 realistic synthetic event logs and compare it to three
state-of-the-art conformance checking methods. DeepAlign produces better
corrections than the rest of the field reaching an overall score of
across all datasets, whereas the best comparable state-of-the-art
method reaches
Encoding conformance checking artefacts in SAT
Conformance checking strongly relies on the computation of artefacts, which enable reasoning on the relation between observed and modeled behavior. This paper shows how important conformance artefacts like alignments, anti-alignments or even multi-alignments, defined over the edit distance, can be computed by encoding the problem as a SAT instance. From a general perspective, the work advocates for a unified family of techniques that can compute conformance artefacts in the same way. The prototype implementation of the techniques presented in this paper show capabilities for dealing with some of the current benchmarks, and potential for the near future when optimizations similar to the ones in the literature are incorporated.Peer ReviewedPostprint (author's final draft
- …