31,811 research outputs found
Approximating Minimum Cost Connectivity Orientation and Augmentation
We investigate problems addressing combined connectivity augmentation and
orientations settings. We give a polynomial-time 6-approximation algorithm for
finding a minimum cost subgraph of an undirected graph that admits an
orientation covering a nonnegative crossing -supermodular demand function,
as defined by Frank. An important example is -edge-connectivity, a
common generalization of global and rooted edge-connectivity.
Our algorithm is based on a non-standard application of the iterative
rounding method. We observe that the standard linear program with cut
constraints is not amenable and use an alternative linear program with
partition and co-partition constraints instead. The proof requires a new type
of uncrossing technique on partitions and co-partitions.
We also consider the problem setting when the cost of an edge can be
different for the two possible orientations. The problem becomes substantially
more difficult already for the simpler requirement of -edge-connectivity.
Khanna, Naor, and Shepherd showed that the integrality gap of the natural
linear program is at most when and conjectured that it is constant
for all fixed . We disprove this conjecture by showing an
integrality gap even when
Split digraphs
We generalize the class of split graphs to the directed case and show that
these split digraphs can be identified from their degree sequences. The first
degree sequence characterization is an extension of the concept of splittance
to directed graphs, while the second characterization says a digraph is split
if and only if its degree sequence satisfies one of the Fulkerson inequalities
(which determine when an integer-pair sequence is digraphic) with equality.Comment: 14 pages, 2 figures; Accepted author manuscript (AAM) versio
Clustering and Community Detection with Imbalanced Clusters
Spectral clustering methods which are frequently used in clustering and
community detection applications are sensitive to the specific graph
constructions particularly when imbalanced clusters are present. We show that
ratio cut (RCut) or normalized cut (NCut) objectives are not tailored to
imbalanced cluster sizes since they tend to emphasize cut sizes over cut
values. We propose a graph partitioning problem that seeks minimum cut
partitions under minimum size constraints on partitions to deal with imbalanced
cluster sizes. Our approach parameterizes a family of graphs by adaptively
modulating node degrees on a fixed node set, yielding a set of parameter
dependent cuts reflecting varying levels of imbalance. The solution to our
problem is then obtained by optimizing over these parameters. We present
rigorous limit cut analysis results to justify our approach and demonstrate the
superiority of our method through experiments on synthetic and real datasets
for data clustering, semi-supervised learning and community detection.Comment: Extended version of arXiv:1309.2303 with new applications. Accepted
to IEEE TSIP
Spectral Clustering with Imbalanced Data
Spectral clustering is sensitive to how graphs are constructed from data
particularly when proximal and imbalanced clusters are present. We show that
Ratio-Cut (RCut) or normalized cut (NCut) objectives are not tailored to
imbalanced data since they tend to emphasize cut sizes over cut values. We
propose a graph partitioning problem that seeks minimum cut partitions under
minimum size constraints on partitions to deal with imbalanced data. Our
approach parameterizes a family of graphs, by adaptively modulating node
degrees on a fixed node set, to yield a set of parameter dependent cuts
reflecting varying levels of imbalance. The solution to our problem is then
obtained by optimizing over these parameters. We present rigorous limit cut
analysis results to justify our approach. We demonstrate the superiority of our
method through unsupervised and semi-supervised experiments on synthetic and
real data sets.Comment: 24 pages, 7 figures. arXiv admin note: substantial text overlap with
arXiv:1302.513
On morphological hierarchical representations for image processing and spatial data clustering
Hierarchical data representations in the context of classi cation and data
clustering were put forward during the fties. Recently, hierarchical image
representations have gained renewed interest for segmentation purposes. In this
paper, we briefly survey fundamental results on hierarchical clustering and
then detail recent paradigms developed for the hierarchical representation of
images in the framework of mathematical morphology: constrained connectivity
and ultrametric watersheds. Constrained connectivity can be viewed as a way to
constrain an initial hierarchy in such a way that a set of desired constraints
are satis ed. The framework of ultrametric watersheds provides a generic scheme
for computing any hierarchical connected clustering, in particular when such a
hierarchy is constrained. The suitability of this framework for solving
practical problems is illustrated with applications in remote sensing
- …