126 research outputs found
Searching for Realizations of Finite Metric Spaces in Tight Spans
An important problem that commonly arises in areas such as internet
traffic-flow analysis, phylogenetics and electrical circuit design, is to find
a representation of any given metric on a finite set by an edge-weighted
graph, such that the total edge length of the graph is minimum over all such
graphs. Such a graph is called an optimal realization and finding such
realizations is known to be NP-hard. Recently Varone presented a heuristic
greedy algorithm for computing optimal realizations. Here we present an
alternative heuristic that exploits the relationship between realizations of
the metric and its so-called tight span . The tight span is a
canonical polytopal complex that can be associated to , and our approach
explores parts of for realizations in a way that is similar to the
classical simplex algorithm. We also provide computational results illustrating
the performance of our approach for different types of metrics, including
-distances and two-decomposable metrics for which it is provably possible
to find optimal realizations in their tight spans.Comment: 20 pages, 3 figure
Characterizing Block Graphs in Terms of their Vertex-Induced Partitions
Given a finite connected simple graph with vertex set and edge
set , we will show that
the (necessarily unique) smallest block graph with vertex set whose
edge set contains is uniquely determined by the -indexed family of the various partitions
of the set into the set of connected components of the
graph ,
the edge set of this block graph coincides with set of all -subsets
of for which and are, for all , contained
in the same connected component of ,
and an arbitrary -indexed family of
partitions of the set is of the form for some
connected simple graph with vertex set as above if and only if,
for any two distinct elements , the union of the set in
that contains and the set in that contains coincides with
the set , and holds for all .
As well as being of inherent interest to the theory of block graphs, these
facts are also useful in the analysis of compatible decompositions and block
realizations of finite metric spaces
Ramified rectilinear polygons: coordinatization by dendrons
Simple rectilinear polygons (i.e. rectilinear polygons without holes or
cutpoints) can be regarded as finite rectangular cell complexes coordinatized
by two finite dendrons. The intrinsic -metric is thus inherited from the
product of the two finite dendrons via an isometric embedding. The rectangular
cell complexes that share this same embedding property are called ramified
rectilinear polygons. The links of vertices in these cell complexes may be
arbitrary bipartite graphs, in contrast to simple rectilinear polygons where
the links of points are either 4-cycles or paths of length at most 3. Ramified
rectilinear polygons are particular instances of rectangular complexes obtained
from cube-free median graphs, or equivalently simply connected rectangular
complexes with triangle-free links. The underlying graphs of finite ramified
rectilinear polygons can be recognized among graphs in linear time by a
Lexicographic Breadth-First-Search. Whereas the symmetry of a simple
rectilinear polygon is very restricted (with automorphism group being a
subgroup of the dihedral group ), ramified rectilinear polygons are
universal: every finite group is the automorphism group of some ramified
rectilinear polygon.Comment: 27 pages, 6 figure
mixtools: An R Package for Analyzing Mixture Models
The mixtools package for R provides a set of functions for analyzing a variety of finite mixture models. These functions include both traditional methods, such as EM algorithms for univariate and multivariate normal mixtures, and newer methods that reflect some recent research in finite mixture models. In the latter category, mixtools provides algorithms for estimating parameters in a wide range of different mixture-of-regression contexts, in multinomial mixtures such as those arising from discretizing continuous multivariate data, in nonparametric situations where the multivariate component densities are completely unspecified, and in semiparametric situations such as a univariate location mixture of symmetric but otherwise unspecified densities. Many of the algorithms of the mixtools package are EM algorithms or are based on EM-like ideas, so this article includes an overview of EM algorithms for finite mixture models.
Recognizing and realizing cactus metrics
The problem of realizing finite metric spaces in terms of weighted graphs has many applications. For example, the mathematical and computational properties of metrics that can be realized by trees have been well-studied and such research has laid the foundation of the reconstruction of phylogenetic trees from evolutionary distances. However, as trees may be too restrictive to accurately represent real-world data or phenomena, it is important to understand the relationship between more general graphs and distances. In this paper, we introduce a new type of metric called a cactus metric, that is, a metric that can be realized by a cactus graph. We show that, just as with tree metrics, a cactus metric has a unique optimal realization. In addition, we describe an algorithm that can recognize whether or not a metric is a cactus metric and, if so, compute its optimal realization in time, where is the number of points in the space
On partitioning multivariate self-affine time series
Given a multivariate time series, possibly of high dimension, with unknown and time-varying joint distribution, it is of interest to be able to completely partition the time series into disjoint, contiguous subseries, each of which has different distributional or pattern attributes from the preceding and succeeding subseries. An additional feature of many time series is that they display self-affinity, so that subseries at one time scale are similar to subseries at another after application of an affine transformation. Such qualities are observed in time series from many disciplines, including biology, medicine, economics, finance, and computer science. This paper defines the relevant multiobjective combinatorial optimization problem with limited assumptions as a biobjective one, and a specialized evolutionary algorithm is presented which finds optimal self-affine time series partitionings with a minimum of choice parameters. The algorithm not only finds partitionings for all possible numbers of partitions given data constraints, but also for self-affinities between these partitionings and some fine-grained partitioning. The resulting set of Pareto-efficient solution sets provides a rich representation of the self-affine properties of a multivariate time series at different locations and time scales
On partitioning multivariate self-affine time series
Given a multivariate time series, possibly of high dimension, with unknown and time-varying joint distribution, it is of interest to be able to completely partition the time series into disjoint, contiguous subseries, each of which has different distributional or pattern attributes from the preceding and succeeding subseries. An additional feature of many time series is that they display self-affinity, so that subseries at one time scale are similar to subseries at another after application of an affine transformation. Such qualities are observed in time series from many disciplines, including biology, medicine, economics, finance, and computer science. This paper defines the relevant multiobjective combinatorial optimization problem with limited assumptions as a biobjective one, and a specialized evolutionary algorithm is presented which finds optimal self-affine time series partitionings with a minimum of choice parameters. The algorithm not only finds partitionings for all possible numbers of partitions given data constraints, but also for self-affinities between these partitionings and some fine-grained partitioning. The resulting set of Pareto-efficient solution sets provides a rich representation of the self-affine properties of a multivariate time series at different locations and time scales
Trees, Tight-Spans and Point Configuration
Tight-spans of metrics were first introduced by Isbell in 1964 and
rediscovered and studied by others, most notably by Dress, who gave them this
name. Subsequently, it was found that tight-spans could be defined for more
general maps, such as directed metrics and distances, and more recently for
diversities. In this paper, we show that all of these tight-spans as well as
some related constructions can be defined in terms of point configurations.
This provides a useful way in which to study these objects in a unified and
systematic way. We also show that by using point configurations we can recover
results concerning one-dimensional tight-spans for all of the maps we consider,
as well as extend these and other results to more general maps such as
symmetric and unsymmetric maps.Comment: 21 pages, 2 figure
- …