52,040 research outputs found
A geometry of information, I: Nerves, posets and differential forms
The main theme of this workshop (Dagstuhl seminar 04351) is `Spatial
Representation: Continuous vs. Discrete'. Spatial representation has two
contrasting but interacting aspects (i) representation of spaces' and (ii)
representation by spaces. In this paper, we will examine two aspects that are
common to both interpretations of the theme, namely nerve constructions and
refinement. Representations change, data changes, spaces change. We will
examine the possibility of a `differential geometry' of spatial representations
of both types, and in the sequel give an algebra of differential forms that has
the potential to handle the dynamical aspect of such a geometry. We will
discuss briefly a conjectured class of spaces, generalising the Cantor set
which would seem ideal as a test-bed for the set of tools we are developing.Comment: 28 pages. A version of this paper appears also on the Dagstuhl
seminar portal http://drops.dagstuhl.de/portals/04351
Antimatroids and Balanced Pairs
We generalize the 1/3-2/3 conjecture from partially ordered sets to
antimatroids: we conjecture that any antimatroid has a pair of elements x,y
such that x has probability between 1/3 and 2/3 of appearing earlier than y in
a uniformly random basic word of the antimatroid. We prove the conjecture for
antimatroids of convex dimension two (the antimatroid-theoretic analogue of
partial orders of width two), for antimatroids of height two, for antimatroids
with an independent element, and for the perfect elimination antimatroids and
node search antimatroids of several classes of graphs. A computer search shows
that the conjecture is true for all antimatroids with at most six elements.Comment: 16 pages, 5 figure
Variable sets over an algebra of lifetimes: a contribution of lattice theory to the study of computational topology
A topos theoretic generalisation of the category of sets allows for modelling
spaces which vary according to time intervals. Persistent homology, or more
generally, persistence is a central tool in topological data analysis, which
examines the structure of data through topology. The basic techniques have been
extended in several different directions, permuting the encoding of topological
features by so called barcodes or equivalently persistence diagrams. The set of
points of all such diagrams determines a complete Heyting algebra that can
explain aspects of the relations between persistent bars through the algebraic
properties of its underlying lattice structure. In this paper, we investigate
the topos of sheaves over such algebra, as well as discuss its construction and
potential for a generalised simplicial homology over it. In particular we are
interested in establishing a topos theoretic unifying theory for the various
flavours of persistent homology that have emerged so far, providing a global
perspective over the algebraic foundations of applied and computational
topology.Comment: 20 pages, 12 figures, AAA88 Conference proceedings at Demonstratio
Mathematica. The new version has restructured arguments, clearer intuition is
provided, and several typos correcte
On mining complex sequential data by means of FCA and pattern structures
Nowadays data sets are available in very complex and heterogeneous ways.
Mining of such data collections is essential to support many real-world
applications ranging from healthcare to marketing. In this work, we focus on
the analysis of "complex" sequential data by means of interesting sequential
patterns. We approach the problem using the elegant mathematical framework of
Formal Concept Analysis (FCA) and its extension based on "pattern structures".
Pattern structures are used for mining complex data (such as sequences or
graphs) and are based on a subsumption operation, which in our case is defined
with respect to the partial order on sequences. We show how pattern structures
along with projections (i.e., a data reduction of sequential structures), are
able to enumerate more meaningful patterns and increase the computing
efficiency of the approach. Finally, we show the applicability of the presented
method for discovering and analyzing interesting patient patterns from a French
healthcare data set on cancer. The quantitative and qualitative results (with
annotations and analysis from a physician) are reported in this use case which
is the main motivation for this work.
Keywords: data mining; formal concept analysis; pattern structures;
projections; sequences; sequential data.Comment: An accepted publication in International Journal of General Systems.
The paper is created in the wake of the conference on Concept Lattice and
their Applications (CLA'2013). 27 pages, 9 figures, 3 table
- …