Search CORE

12,348 research outputs found

Heuristics Miners for Streaming Event Data

Author: Burattin Andrea
Sperduti Alessandro
van der Aalst Wil M. P.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/12/2012
Field of study

More and more business activities are performed using information systems. These systems produce such huge amounts of event data that existing systems are unable to store and process them. Moreover, few processes are in steady-state and due to changing circumstances processes evolve and systems need to adapt continuously. Since conventional process discovery algorithms have been defined for batch processing, it is difficult to apply them in such evolving environments. Existing algorithms cannot cope with streaming event data and tend to generate unreliable and obsolete results. In this paper, we discuss the peculiarities of dealing with streaming event data in the context of process mining. Subsequently, we present a general framework for defining process mining algorithms in settings where it is impossible to store all events over an extended period or where processes evolve while being analyzed. We show how the Heuristics Miner, one of the most effective process discovery algorithms for practical applications, can be modified using this framework. Different stream-aware versions of the Heuristics Miner are defined and implemented in ProM. Moreover, experimental results on artificial and real logs are reported

arXiv.org e-Print Archive

CiteSeerX

Intelligent Management and Efficient Operation of Big Data

Author: Batista Fernando
Cardoso Elsa
Moura Jose
Nunes Luis
Publication venue
Publication date: 01/01/2015
Field of study

This chapter details how Big Data can be used and implemented in networking and computing infrastructures. Specifically, it addresses three main aspects: the timely extraction of relevant knowledge from heterogeneous, and very often unstructured large data sources, the enhancement on the performance of processing and networking (cloud) infrastructures that are the most important foundational pillars of Big Data applications or services, and novel ways to efficiently manage network infrastructures with high-level composed policies for supporting the transmission of large amounts of data with distinct requisites (video vs. non-video). A case study involving an intelligent management solution to route data traffic with diverse requirements in a wide area Internet Exchange Point is presented, discussed in the context of Big Data, and evaluated.Comment: In book Handbook of Research on Trends and Future Directions in Big Data and Web Intelligence, IGI Global, 201

arXiv.org e-Print Archive

Crossref

Repositório Institucional do ISCTE-IUL

Using Visualization to Support Data Mining of Large Existing Databases

Author: Keim Daniel A.
Kriegel Hans-Peter
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1994
Field of study

In this paper. we present ideas how visualization technology can be used to improve the difficult process of querying very large databases. With our VisDB system, we try to provide visual support not only for the query specification process. but also for evaluating query results and. thereafter, refining the query accordingly. The main idea of our system is to represent as many data items as possible by the pixels of the display device. By arranging and coloring the pixels according to the relevance for the query, the user gets a visual impression of the resulting data set and of its relevance for the query. Using an interactive query interface, the user may change the query dynamically and receives immediate feedback by the visual representation of the resulting data set. By using multiple windows for different parts of the query, the user gets visual feedback for each part of the query and, therefore, may easier understand the overall result. To support complex queries, we introduce the notion of approximate joins which allow the user to find data items that only approximately fulfill join conditions. We also present ideas how our technique may be extended to support the interoperation of heterogeneous databases. Finally, we discuss the performance problems that are caused by interfacing to existing database systems and present ideas to solve these problems by using data structures supporting a multidimensional search of the database

KOPS - The Institutional Repository of the University of Konstanz

Open Access LMU

Requirements analysis for decision-support system design: evidence from the automotive industry

Author: Madenas Nikolaos
Peachey Sophie
Tiwari Ashutosh
Turner Christopher
Publication venue: Cranfield University Press
Publication date: 19/09/2013
Field of study

The purpose of this paper is to outline the requirements analysis that was carried out to support the development of a system that allows engineers to view real-time data integrated from multiple silos such as Product Lifecycle Management (PLM) and Warranty systems, in a single and visual environment. The outcome of this study provides a clear understanding of how engineers working in different phases of the product-lifecycle could utilise such information to improve the decision making process and as a result design better products. This study uses data collected via in-depth semi-structured interviews and workshops that includes people working in various roles within the automotive sector. In order to demonstrate the applicability this approach, SysML diagrams are also provided

Cranfield CERES

Intergenerational equity and conservation

Author: Otoole R. P.
Walton A. L.
Publication venue
Publication date
Field of study

The issue of integenerational equity in the use of natural resources is discussed in the context of coal mining conversion. An attempt to determine if there is a clear-cut benefit to future generations in setting minimum coal extraction efficiency standards in mining is made. It is demonstrated that preserving fossil fuels beyond the economically efficient level is not necessarily beneficial to future generations even in terms of their own preferences. Setting fossil fuel conservation targets for intermediate products (i.e. energy) may increase the quantities of fossil fuels available to future generations and hence lower the costs, but there may be serious disadvantages to future generations as well. The use of relatively inexpensive fossil fuels in this generation may result in more infrastructure development and more knowledge production available to future generations. The value of fossil fuels versus these other endowments in the future depends on many factors which cannot possibly be evaluated at present. Since there is no idea of whether future generations are being helped or harmed, it is recommended that integenerational equity not be used as a factor in setting coal mine extraction efficiency standards, or in establishing requirements

NASA Technical Reports Server