Search CORE

7,561 research outputs found

Constant Factor Approximation for Balanced Cut in the PIE model

Author: Bilu Yonatan
Bilu Yonatan
Condon Anne
Dimitriou Tassos
Newman Mark
Publication venue
Publication date: 21/06/2014
Field of study

We propose and study a new semi-random semi-adversarial model for Balanced Cut, a planted model with permutation-invariant random edges (PIE). Our model is much more general than planted models considered previously. Consider a set of vertices V partitioned into two clusters

L

and

R

of equal size. Let

G

be an arbitrary graph on

V

with no edges between

L

and

R

. Let

E_{random}

be a set of edges sampled from an arbitrary permutation-invariant distribution (a distribution that is invariant under permutation of vertices in

L

and in

R

). Then we say that

G + E_{random}

is a graph with permutation-invariant random edges. We present an approximation algorithm for the Balanced Cut problem that finds a balanced cut of cost

O(|E_{random}|) + n \text{polylog}(n)

in this model. In the regime when

|E_{random}| = \Omega(n \text{polylog}(n))

, this is a constant factor approximation with respect to the cost of the planted cut.Comment: Full version of the paper at the 46th ACM Symposium on the Theory of Computing (STOC 2014). 32 page

arXiv.org e-Print Archive

Crossref

Correction. Brownian models of open processing networks: canonical representation of workload

Author: Harrison J. Michael
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 11/10/2006
Field of study

Due to a printing error the above mentioned article [Annals of Applied Probability 10 (2000) 75--103, doi:10.1214/aoap/1019737665] had numerous equations appearing incorrectly in the print version of this paper. The entire article follows as it should have appeared. IMS apologizes to the author and the readers for this error. A recent paper by Harrison and Van Mieghem explained in general mathematical terms how one forms an ``equivalent workload formulation'' of a Brownian network model. Denoting by

Z(t)

the state vector of the original Brownian network, one has a lower dimensional state descriptor

W(t)=MZ(t)

in the equivalent workload formulation, where

M

can be chosen as any basis matrix for a particular linear space. This paper considers Brownian models for a very general class of open processing networks, and in that context develops a more extensive interpretation of the equivalent workload formulation, thus extending earlier work by Laws on alternate routing problems. A linear program called the static planning problem is introduced to articulate the notion of ``heavy traffic'' for a general open network, and the dual of that linear program is used to define a canonical choice of the basis matrix

M

. To be specific, rows of the canonical

M

are alternative basic optimal solutions of the dual linear program. If the network data satisfy a natural monotonicity condition, the canonical matrix

M

is shown to be nonnegative, and another natural condition is identified which ensures that

M

admits a factorization related to the notion of resource pooling.Comment: Published at http://dx.doi.org/10.1214/105051606000000583 in the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

Early stopping for statistical inverse problems via truncated SVD estimation

Author: Blanchard Gilles
Hoffmann Marc
Reiß Markus
Publication venue
Publication date: 01/01/2018
Field of study

We consider truncated SVD (or spectral cut-off, projection) estimators for a prototypical statistical inverse problem in dimension

D

. Since calculating the singular value decomposition (SVD) only for the largest singular values is much less costly than the full SVD, our aim is to select a data-driven truncation level

\widehat m\in\{1,\ldots,D\}

only based on the knowledge of the first

\widehat m

singular values and vectors. We analyse in detail whether sequential {\it early stopping} rules of this type can preserve statistical optimality. Information-constrained lower bounds and matching upper bounds for a residual based stopping rule are provided, which give a clear picture in which situation optimal sequential adaptation is feasible. Finally, a hybrid two-step approach is proposed which allows for classical oracle inequalities while considerably reducing numerical complexity.Comment: slightly modified version. arXiv admin note: text overlap with arXiv:1606.0770

arXiv.org e-Print Archive

HAL Descartes

Three Puzzles on Mathematics, Computation, and Games

Author: Kalai Gil
Publication venue
Publication date: 08/01/2018
Field of study

In this lecture I will talk about three mathematical puzzles involving mathematics and computation that have preoccupied me over the years. The first puzzle is to understand the amazing success of the simplex algorithm for linear programming. The second puzzle is about errors made when votes are counted during elections. The third puzzle is: are quantum computers possible?Comment: ICM 2018 plenary lecture, Rio de Janeiro, 36 pages, 7 Figure

arXiv.org e-Print Archive

Crossref

Consistency Thresholds for the Planted Bisection Model

Author: Mossel Elchanan
Neeman Joe
Sly Allan
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 25/11/2019
Field of study

The planted bisection model is a random graph model in which the nodes are divided into two equal-sized communities and then edges are added randomly in a way that depends on the community membership. We establish necessary and sufficient conditions for the asymptotic recoverability of the planted bisection in this model. When the bisection is asymptotically recoverable, we give an efficient algorithm that successfully recovers it. We also show that the planted bisection is recoverable asymptotically if and only if with high probability every node belongs to the same community as the majority of its neighbors. Our algorithm for finding the planted bisection runs in time almost linear in the number of edges. It has three stages: spectral clustering to compute an initial guess, a "replica" stage to get almost every vertex correct, and then some simple local moves to finish the job. An independent work by Abbe, Bandeira, and Hall establishes similar (slightly weaker) results but only in the case of logarithmic average degree.Comment: latest version contains an erratum, addressing an error pointed out by Jan van Waai

arXiv.org e-Print Archive

The Australian National University