Search CORE

23 research outputs found

Towards a better approximation for sparsest cut?

Author: Arora Sanjeev
Ge Rong
Sinop Ali Kemal
Publication venue
Publication date: 01/01/2013
Field of study

We give a new

(1+\epsilon)

-approximation for sparsest cut problem on graphs where small sets expand significantly more than the sparsest cut (sets of size

n/r

expand by a factor

\sqrt{\log n\log r}

bigger, for some small

r

; this condition holds for many natural graph families). We give two different algorithms. One involves Guruswami-Sinop rounding on the level-

r

Lasserre relaxation. The other is combinatorial and involves a new notion called {\em Small Set Expander Flows} (inspired by the {\em expander flows} of ARV) which we show exists in the input graph. Both algorithms run in time

2^{O(r)} \mathrm{poly}(n)

. We also show similar approximation algorithms in graphs with genus

g

with an analogous local expansion condition. This is the first algorithm we know of that achieves

(1+\epsilon)

-approximation on such general family of graphs

arXiv.org e-Print Archive

CiteSeerX

Princeton University Open Access Repository

On non-linear network embedding methods

Author: Le Huong Yen
Publication venue: Digital Commons @ NJIT
Publication date: 31/08/2021
Field of study

As a linear method, spectral clustering is the only network embedding algorithm that offers both a provably fast computation and an advanced theoretical understanding. The accuracy of spectral clustering depends on the Cheeger ratio defined as the ratio between the graph conductance and the 2nd smallest eigenvalue of its normalizedLaplacian. In several graph families whose Cheeger ratio reaches its upper bound of Theta(n), the approximation power of spectral clustering is proven to perform poorly. Moreover, recent non-linear network embedding methods have surpassed spectral clustering by state-of-the-art performance with little to no theoretical understanding to back them. The dissertation includes work that: (1) extends the theory of spectral clustering in order to address its weakness and provide ground for a theoretical understanding of existing non-linear network embedding methods.; (2) provides non-linear extensions of spectral clustering with theoretical guarantees, e.g., via different spectral modification algorithms; (3) demonstrates the potentials of this approach on different types and sizes of graphs from industrial applications; and (4)makes a theory-informed use of artificial networks

Digital Commons @ New Jersey Institute of Technology (NJIT)

Pythagorean powers of hypercubes

Author: Naor Assaf
Schechtman Gideon
Publication venue: 'Cellule MathDoc/CEDRAM'
Publication date: 27/01/2015
Field of study

For

n\in \mathbb{N}

consider the

n

-dimensional hypercube as equal to the vector space

\mathbb{F}_2^n

, where

\mathbb{F}_2

is the field of size two. Endow

\mathbb{F}_2^n

with the Hamming metric, i.e., with the metric induced by the

\ell_1^n

norm when one identifies

\mathbb{F}_2^n

with

\{0,1\}^n\subseteq \mathbb{R}^n

. Denote by

\ell_2^n(\mathbb{F}_2^n)

the

n

-fold Pythagorean product of

\mathbb{F}_2^n

, i.e., the space of all

x=(x_1,\ldots,x_n)\in \prod_{j=1}^n \mathbb{F}_2^n

, equipped with the metric

\forall\, x,y\in \prod_{j=1}^n \mathbb{F}_2^n,\qquad d_{\ell_2^n(\mathbb{F}_2^n)}(x,y)= \sqrt{ \|x_1-y_1\|_1^2+\ldots+\|x_n-y_n\|_1^2}.

It is shown here that the bi-Lipschitz distortion of any embedding of

\ell_2^n(\mathbb{F}_2^n)

into

L_1

is at least a constant multiple of

\sqrt{n}

. This is achieved through the following new bi-Lipschitz invariant, which is a metric version of (a slight variant of) a linear inequality of Kwapie{\'n} and Sch\"utt (1989). Letting

\{e_{jk}\}_{j,k\in \{1,\ldots,n\}}

denote the standard basis of the space of all

n

n

matrices

M_n(\mathbb{F}_2)

, say that a metric space

(X,d_X)

is a KS space if there exists

C=C(X)>0

such that for every

n\in 2\mathbb{N}

, every mapping

f:M_n(\mathbb{F}_2)\to X

satisfies \begin{equation*}\label{eq:metric KS abstract} \frac{1}{n}\sum_{j=1}^n\mathbb{E}\left[d_X\Big(f\Big(x+\sum_{k=1}^ne_{jk}\Big),f(x)\Big)\right]\le C \mathbb{E}\left[d_X\Big(f\Big(x+\sum_{j=1}^ne_{jk_j}\Big),f(x)\Big)\right], \end{equation*} where the expectations above are with respect to

x\in M_n(\mathbb{F}_2)

and

k=(k_1,\ldots,k_n)\in \{1,\ldots,n\}^n

chosen uniformly at random. It is shown here that

L_1

is a KS space (with

C= 2e^2/(e^2-1)

, which is best possible), implying the above nonembeddability statement. Links to the Ribe program are discussed, as well as related open problems.Comment: added section

arXiv.org e-Print Archive

Numérisation de Documents Anciens Mathématiques

Annales de l’institut Fourier (AIF)

Algorithms for partitioning well-clustered graphs

Author: Zanetti Luca
Publication venue
Publication date: 25/09/2018
Field of study

Explore Bristol Research

Recommended from our members

Essays in transportation inequalities, entropic gradient flows and mean field approximations

Author: Yeung Lane Chun Lanston
Publication venue
Publication date: 01/01/2023
Field of study

This thesis consists of four chapters. In Chapter 1, we focus on a class of transportation inequalities known as the transportation-information inequalities. These inequalities bound optimal transportation costs in terms of relative Fisher information, and are known to characterize certain concentration properties of Markov processes around their invariant measures. We provide a characterization of the quadratic transportation-information inequality in terms of a dimension-free concentration property for i.i.d. copies of the underlying Markov process, identifying the precise high-dimensional concentration property encoded by this inequality. We also illustrate how this result is an instance of a general convex-analytic tensorization principle. In Chapter 2, we study the entropic gradient flow property of McKean--Vlasov diffusions via a stochastic analysis approach. We formulate a trajectorial version of the relative entropy dissipation identity for these interacting diffusions, which describes the rate of relative entropy dissipation along every path of the diffusive motion. As a first application, we obtain a new interpretation of the gradient flow structure for the granular media equation. Secondly, we show how the trajectorial approach leads to a new derivation of the HWBI inequality. In Chapter 3, we further extend the trajectorial approach to a class of degenerate diffusion equations that includes the porous medium equation. These equations are posed on a bounded domain and are subject to no-flux boundary conditions, so that their corresponding probabilistic representations are stochastic differential equations with normal reflection on the boundary. Our stochastic analysis approach again leads to a new derivation of the Wasserstein gradient flow property for these nonlinear diffusions, as well as to a simple proof of the HWI inequality in the present context. Finally, in Chapter 4, we turn our attention to mean field approximation -- a method widely used to study the behavior of large stochastic systems of interacting particles. We propose a new approach to deriving quantitative mean field approximations for any strongly log-concave probability measure. Our framework is inspired by the recent theory of nonlinear large deviations, for which we offer an efficient non-asymptotic perspective in log-concave settings based on functional inequalities. We discuss three implications, in the contexts of continuous Gibbs measures on large graphs, high-dimensional Bayesian linear regression, and the construction of decentralized near-optimizers in high-dimensional stochastic control problems

Columbia University Academic Commons