5,519 research outputs found
Approximating Hereditary Discrepancy via Small Width Ellipsoids
The Discrepancy of a hypergraph is the minimum attainable value, over
two-colorings of its vertices, of the maximum absolute imbalance of any
hyperedge. The Hereditary Discrepancy of a hypergraph, defined as the maximum
discrepancy of a restriction of the hypergraph to a subset of its vertices, is
a measure of its complexity. Lovasz, Spencer and Vesztergombi (1986) related
the natural extension of this quantity to matrices to rounding algorithms for
linear programs, and gave a determinant based lower bound on the hereditary
discrepancy. Matousek (2011) showed that this bound is tight up to a
polylogarithmic factor, leaving open the question of actually computing this
bound. Recent work by Nikolov, Talwar and Zhang (2013) showed a polynomial time
-approximation to hereditary discrepancy, as a by-product
of their work in differential privacy. In this paper, we give a direct simple
-approximation algorithm for this problem. We show that up to
this approximation factor, the hereditary discrepancy of a matrix is
characterized by the optimal value of simple geometric convex program that
seeks to minimize the largest norm of any point in a ellipsoid
containing the columns of . This characterization promises to be a useful
tool in discrepancy theory
An Algorithm for Koml\'os Conjecture Matching Banaszczyk's bound
We consider the problem of finding a low discrepancy coloring for sparse set
systems where each element lies in at most t sets. We give an efficient
algorithm that finds a coloring with discrepancy O((t log n)^{1/2}), matching
the best known non-constructive bound for the problem due to Banaszczyk. The
previous algorithms only achieved an O(t^{1/2} log n) bound. The result also
extends to the more general Koml\'{o}s setting and gives an algorithmic
O(log^{1/2} n) bound
Generalised Pattern Matching Revisited
In the problem of
[STOC'94, Muthukrishnan and Palem], we are given a text of length over
an alphabet , a pattern of length over an alphabet
, and a matching relationship ,
and must return all substrings of that match (reporting) or the number
of mismatches between each substring of of length and (counting).
In this work, we improve over all previously known algorithms for this problem
for various parameters describing the input instance:
* being the maximum number of characters that match a fixed
character,
* being the number of pairs of matching characters,
* being the total number of disjoint intervals of characters
that match the characters of the pattern .
At the heart of our new deterministic upper bounds for and
lies a faster construction of superimposed codes, which solves
an open problem posed in [FOCS'97, Indyk] and can be of independent interest.
To conclude, we demonstrate first lower bounds for . We start by
showing that any deterministic or Monte Carlo algorithm for must
use time, and then proceed to show higher lower bounds
for combinatorial algorithms. These bounds show that our algorithms are almost
optimal, unless a radically new approach is developed
Towards a Constructive Version of Banaszczyk's Vector Balancing Theorem
An important theorem of Banaszczyk (Random Structures & Algorithms `98)
states that for any sequence of vectors of norm at most and any
convex body of Gaussian measure in , there exists a
signed combination of these vectors which lands inside . A major open
problem is to devise a constructive version of Banaszczyk's vector balancing
theorem, i.e. to find an efficient algorithm which constructs the signed
combination.
We make progress towards this goal along several fronts. As our first
contribution, we show an equivalence between Banaszczyk's theorem and the
existence of -subgaussian distributions over signed combinations. For the
case of symmetric convex bodies, our equivalence implies the existence of a
universal signing algorithm (i.e. independent of the body), which simply
samples from the subgaussian sign distribution and checks to see if the
associated combination lands inside the body. For asymmetric convex bodies, we
provide a novel recentering procedure, which allows us to reduce to the case
where the body is symmetric.
As our second main contribution, we show that the above framework can be
efficiently implemented when the vectors have length ,
recovering Banaszczyk's results under this stronger assumption. More precisely,
we use random walk techniques to produce the required -subgaussian
signing distributions when the vectors have length , and
use a stochastic gradient ascent method to implement the recentering procedure
for asymmetric bodies
Approximate Hypergraph Coloring under Low-discrepancy and Related Promises
A hypergraph is said to be -colorable if its vertices can be colored
with colors so that no hyperedge is monochromatic. -colorability is a
fundamental property (called Property B) of hypergraphs and is extensively
studied in combinatorics. Algorithmically, however, given a -colorable
-uniform hypergraph, it is NP-hard to find a -coloring miscoloring fewer
than a fraction of hyperedges (which is achieved by a random
-coloring), and the best algorithms to color the hypergraph properly require
colors, approaching the trivial bound of as
increases.
In this work, we study the complexity of approximate hypergraph coloring, for
both the maximization (finding a -coloring with fewest miscolored edges) and
minimization (finding a proper coloring using fewest number of colors)
versions, when the input hypergraph is promised to have the following stronger
properties than -colorability:
(A) Low-discrepancy: If the hypergraph has discrepancy ,
we give an algorithm to color the it with colors.
However, for the maximization version, we prove NP-hardness of finding a
-coloring miscoloring a smaller than (resp. )
fraction of the hyperedges when (resp. ). Assuming
the UGC, we improve the latter hardness factor to for almost
discrepancy- hypergraphs.
(B) Rainbow colorability: If the hypergraph has a -coloring such
that each hyperedge is polychromatic with all these colors, we give a
-coloring algorithm that miscolors at most of the
hyperedges when , and complement this with a matching UG
hardness result showing that when , it is hard to even beat the
bound achieved by a random coloring.Comment: Approx 201
On a generalization of iterated and randomized rounding
We give a general method for rounding linear programs that combines the
commonly used iterated rounding and randomized rounding techniques. In
particular, we show that whenever iterated rounding can be applied to a problem
with some slack, there is a randomized procedure that returns an integral
solution that satisfies the guarantees of iterated rounding and also has
concentration properties. We use this to give new results for several classic
problems where iterated rounding has been useful
On Integer Programming, Discrepancy, and Convolution
Integer programs with a constant number of constraints are solvable in
pseudo-polynomial time. We give a new algorithm with a better pseudo-polynomial
running time than previous results. Moreover, we establish a strong connection
to the problem (min, +)-convolution. (min, +)-convolution has a trivial
quadratic time algorithm and it has been conjectured that this cannot be
improved significantly. We show that further improvements to our
pseudo-polynomial algorithm for any fixed number of constraints are equivalent
to improvements for (min, +)-convolution. This is a strong evidence that our
algorithm's running time is the best possible. We also present a faster
specialized algorithm for testing feasibility of an integer program with few
constraints and for this we also give a tight lower bound, which is based on
the SETH.Comment: A preliminary version appeared in the proceedings of ITCS 201
- …