40,533 research outputs found
On mining complex sequential data by means of FCA and pattern structures
Nowadays data sets are available in very complex and heterogeneous ways.
Mining of such data collections is essential to support many real-world
applications ranging from healthcare to marketing. In this work, we focus on
the analysis of "complex" sequential data by means of interesting sequential
patterns. We approach the problem using the elegant mathematical framework of
Formal Concept Analysis (FCA) and its extension based on "pattern structures".
Pattern structures are used for mining complex data (such as sequences or
graphs) and are based on a subsumption operation, which in our case is defined
with respect to the partial order on sequences. We show how pattern structures
along with projections (i.e., a data reduction of sequential structures), are
able to enumerate more meaningful patterns and increase the computing
efficiency of the approach. Finally, we show the applicability of the presented
method for discovering and analyzing interesting patient patterns from a French
healthcare data set on cancer. The quantitative and qualitative results (with
annotations and analysis from a physician) are reported in this use case which
is the main motivation for this work.
Keywords: data mining; formal concept analysis; pattern structures;
projections; sequences; sequential data.Comment: An accepted publication in International Journal of General Systems.
The paper is created in the wake of the conference on Concept Lattice and
their Applications (CLA'2013). 27 pages, 9 figures, 3 table
A conceptual approach to gene expression analysis enhanced by visual analytics
The analysis of gene expression data is a complex task for biologists wishing to understand the role of genes in the formation of diseases such as cancer. Biologists need greater support when trying to discover, and comprehend, new relationships within their data. In this paper, we describe an approach to the analysis of gene expression data where overlapping groupings are generated by Formal Concept Analysis and interactively analyzed in a tool called CUBIST. The CUBIST workflow involves querying a semantic database and converting the result into a formal context, which can be simplified to make it manageable, before it is visualized as a concept lattice and associated charts
How Quantum Computers Fail: Quantum Codes, Correlations in Physical Systems, and Noise Accumulation
The feasibility of computationally superior quantum computers is one of the
most exciting and clear-cut scientific questions of our time. The question
touches on fundamental issues regarding probability, physics, and
computability, as well as on exciting problems in experimental physics,
engineering, computer science, and mathematics. We propose three related
directions towards a negative answer. The first is a conjecture about physical
realizations of quantum codes, the second has to do with correlations in
stochastic physical systems, and the third proposes a model for quantum
evolutions when noise accumulates. The paper is dedicated to the memory of
Itamar Pitowsky.Comment: 16 page
Complexity and Information: Measuring Emergence, Self-organization, and Homeostasis at Multiple Scales
Concepts used in the scientific study of complex systems have become so
widespread that their use and abuse has led to ambiguity and confusion in their
meaning. In this paper we use information theory to provide abstract and
concise measures of complexity, emergence, self-organization, and homeostasis.
The purpose is to clarify the meaning of these concepts with the aid of the
proposed formal measures. In a simplified version of the measures (focusing on
the information produced by a system), emergence becomes the opposite of
self-organization, while complexity represents their balance. Homeostasis can
be seen as a measure of the stability of the system. We use computational
experiments on random Boolean networks and elementary cellular automata to
illustrate our measures at multiple scales.Comment: 42 pages, 11 figures, 2 table
- …