12 research outputs found
Tropical sufficient statistics for persistent homology
We show that an embedding in Euclidean space based on tropical geometry generates stable sufficient statistics for barcodes. In topological data analysis, barcodes are multiscale summaries of algebraic topological characteristics that capture the “shape” of data; however, in practice, they have complex structures that make them difficult to use in statistical settings. The sufficiency result presented in this work allows for classical probability distributions to be assumed on the tropical geometric representation of barcodes. This makes a variety of parametric statistical inference methods accessible to barcodes, all while maintaining their initial interpretations. More specifically, we show that exponential family distributions may be assumed and that likelihood functions for persistent homology may be constructed. We conceptually demonstrate sufficiency and illustrate its utility in persistent homology dimensions 0 and 1 with concrete parametric applications to human immunodeficiency virus and avian influenza data
The persistence landscape and some of its properties
Persistence landscapes map persistence diagrams into a function space, which
may often be taken to be a Banach space or even a Hilbert space. In the latter
case, it is a feature map and there is an associated kernel. The main advantage
of this summary is that it allows one to apply tools from statistics and
machine learning. Furthermore, the mapping from persistence diagrams to
persistence landscapes is stable and invertible. We introduce a weighted
version of the persistence landscape and define a one-parameter family of
Poisson-weighted persistence landscape kernels that may be useful for learning.
We also demonstrate some additional properties of the persistence landscape.
First, the persistence landscape may be viewed as a tropical rational function.
Second, in many cases it is possible to exactly reconstruct all of the
component persistence diagrams from an average persistence landscape. It
follows that the persistence landscape kernel is characteristic for certain
generic empirical measures. Finally, the persistence landscape distance may be
arbitrarily small compared to the interleaving distance.Comment: 18 pages, to appear in the Proceedings of the 2018 Abel Symposiu
Recommended from our members
Parametrized homology via zigzag persistence
This paper introduces parametrized homology, a continuous-parameter generalization of levelset zigzag persistent homology that captures the behavior of the homology of the fibers of a real-valued function on a topological space. This information is encoded as a “barcode” of real intervals, each corresponding to a homological feature supported over that interval; or, equivalently, as a persistence diagram. Points in the persistence diagram are classified algebraically into four classes; geometrically, the classes identify the distinct ways in which homological features perish at the boundaries of their interval of persistence. We study the conditions under which spaces fibered over the real line have a well-defined parametrized homology; we establish the stability of these invariants and we show how the four classes of persistence diagram correspond to the four diagrams that appear in the theory of extended persistence