647 research outputs found
Learning Hybrid Process Models From Events: Process Discovery Without Faking Confidence
Process discovery techniques return process models that are either formal
(precisely describing the possible behaviors) or informal (merely a "picture"
not allowing for any form of formal reasoning). Formal models are able to
classify traces (i.e., sequences of events) as fitting or non-fitting. Most
process mining approaches described in the literature produce such models. This
is in stark contrast with the over 25 available commercial process mining tools
that only discover informal process models that remain deliberately vague on
the precise set of possible traces. There are two main reasons why vendors
resort to such models: scalability and simplicity. In this paper, we propose to
combine the best of both worlds: discovering hybrid process models that have
formal and informal elements. As a proof of concept we present a discovery
technique based on hybrid Petri nets. These models allow for formal reasoning,
but also reveal information that cannot be captured in mainstream formal
models. A novel discovery algorithm returning hybrid Petri nets has been
implemented in ProM and has been applied to several real-life event logs. The
results clearly demonstrate the advantages of remaining "vague" when there is
not enough "evidence" in the data or standard modeling constructs do not "fit".
Moreover, the approach is scalable enough to be incorporated in
industrial-strength process mining tools.Comment: 25 pages, 12 figure
The HyperBagGraph DataEdron: An Enriched Browsing Experience of Multimedia Datasets
Traditional verbatim browsers give back information in a linear way according
to a ranking performed by a search engine that may not be optimal for the
surfer. The latter may need to assess the pertinence of the information
retrieved, particularly when she wants to explore other facets of a
multi-facetted information space. For instance, in a multimedia dataset
different facets such as keywords, authors, publication category, organisations
and figures can be of interest. The facet simultaneous visualisation can help
to gain insights on the information retrieved and call for further searches.
Facets are co-occurence networks, modeled by HyperBag-Graphs -- families of
multisets -- and are in fact linked not only to the publication itself, but to
any chosen reference. These references allow to navigate inside the dataset and
perform visual queries. We explore here the case of scientific publications
based on Arxiv searches.Comment: Extension of the hypergraph framework shortly presented in
arXiv:1809.00164 (possible small overlaps); use the theoretical framework of
hb-graphs presented in arXiv:1809.0019
A recursive paradigm for aligning observed behavior of large structured process models
The alignment of observed and modeled behavior is a crucial problem in process mining, since it opens the door for conformance checking and enhancement of process models. The state of the art techniques for the computation of alignments rely on a full exploration of the combination of the model state space and the observed behavior (an event log), which hampers their applicability for large instances. This paper presents a fresh view to the alignment problem: the computation of alignments is casted as the resolution of Integer Linear Programming models, where the user can decide the granularity of the alignment steps. Moreover, a novel recursive strategy is used to split
the problem into small pieces, exponentially reducing the complexity of the ILP models to be solved. The contributions of this paper represent a promising alternative to fight the inherent complexity of computing alignments for large instances.Peer ReviewedPostprint (author's final draft
- …