Search CORE

73 research outputs found

The Arboricity Captures the Complexity of Sampling Edges

Author: Eden T.
Ron D.
Rosenbaum W.
Publication venue
Publication date: 01/01/2019
Field of study

In this paper, we revisit the problem of sampling edges in an unknown graph

G = (V, E)

from a distribution that is (pointwise) almost uniform over

E

. We consider the case where there is some a priori upper bound on the arboriciy of

G

. Given query access to a graph

G

over

n

vertices and of average degree

d

and arboricity at most

\alpha

, we design an algorithm that performs

O\!\left(\frac{\alpha}{d} \cdot \frac{\log^3 n}{\varepsilon}\right)

queries in expectation and returns an edge in the graph such that every edge

e \in E

is sampled with probability

(1 \pm \varepsilon)/m

. The algorithm performs two types of queries: degree queries and neighbor queries. We show that the upper bound is tight (up to poly-logarithmic factors and the dependence in

\varepsilon

), as

\Omega\!\left(\frac{\alpha}{d} \right)

queries are necessary for the easier task of sampling edges from any distribution over

E

that is close to uniform in total variational distance. We also prove that even if

G

is a tree (i.e.,

\alpha = 1

so that

\frac{\alpha}{d}=\Theta(1)

\Omega\left(\frac{\log n}{\log\log n}\right)

queries are necessary to sample an edge from any distribution that is pointwise close to uniform, thus establishing that a

\mathrm{poly}(\log n)

factor is necessary for constant

\alpha

. Finally we show how our algorithm can be applied to obtain a new result on approximately counting subgraphs, based on the recent work of Assadi, Kapralov, and Khanna (ITCS, 2019)

MPG.PuRe

The Arboricity Captures the Complexity of Sampling Edges

Author: Eden Talya
Ron Dana
Rosenbaum Will
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 46th International Colloquium on Automata, Languages, and Programming (ICALP 2019)
Publication date: 01/01/2019
Field of study

In this paper, we revisit the problem of sampling edges in an unknown graph G = (V, E) from a distribution that is (pointwise) almost uniform over E. We consider the case where there is some a priori upper bound on the arboriciy of G. Given query access to a graph G over n vertices and of average degree {d} and arboricity at most alpha, we design an algorithm that performs O(alpha/d * {log^3 n}/epsilon) queries in expectation and returns an edge in the graph such that every edge e in E is sampled with probability (1 +/- epsilon)/m. The algorithm performs two types of queries: degree queries and neighbor queries. We show that the upper bound is tight (up to poly-logarithmic factors and the dependence in epsilon), as Omega(alpha/d) queries are necessary for the easier task of sampling edges from any distribution over E that is close to uniform in total variational distance. We also prove that even if G is a tree (i.e., alpha = 1 so that alpha/d = Theta(1)), Omega({log n}/{loglog n}) queries are necessary to sample an edge from any distribution that is pointwise close to uniform, thus establishing that a poly(log n) factor is necessary for constant alpha. Finally we show how our algorithm can be applied to obtain a new result on approximately counting subgraphs, based on the recent work of Assadi, Kapralov, and Khanna (ITCS, 2019)

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

MPG.PuRe

Testing Triangle Freeness in the General Model in Graphs with Arboricity O(?n)

Author: Levi Reut
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 48th International Colloquium on Automata, Languages, and Programming (ICALP 2021)
Publication date: 01/01/2021
Field of study

We study the problem of testing triangle freeness in the general graph model. This problem was first studied in the general graph model by Alon et al. (SIAM J. Discret. Math. 2008) who provided both lower bounds and upper bounds that depend on the number of vertices and the average degree of the graph. Their bounds are tight only when d_max = O(d) and ?{d} ? ?n or when ?{d} = ?(1), where d_max denotes the maximum degree and ?{d} denotes the average degree of the graph. In this paper we provide bounds that depend on the arboricity of the graph and the average degree. As in Alon et al., the parameters of our tester is the number of vertices, n, the number of edges, m, and the proximity parameter ? (the arboricity of the graph is not a parameter of the algorithm). The query complexity of our tester is O?(?/ ?{d} + ?)? poly(1/?) on expectation, where ? denotes the arboricity of the input graph (we use O?(?) to suppress O(log log n) factors). We show that for graphs with arboricity O(?n) this upper bound is tight in the following sense. For any ? ? [s] where s = ?(?n) there exists a family of graphs with arboricity ? and average degree ?{d} such that ?(?/ ?{d} + ?) queries are required for testing triangle freeness on this family of graphs. Moreover, this lower bound holds for any such ? and for a large range of feasible average degrees

Dagstuhl Research Online Publication Server

Sampling Multiple Edges Efficiently

Author: Eden Talya
Mossel Saleet
Rubinfeld Ronitt
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2021)
Publication date: 01/01/2021
Field of study

We present a sublinear time algorithm that allows one to sample multiple edges from a distribution that is pointwise ?-close to the uniform distribution, in an amortized-efficient fashion. We consider the adjacency list query model, where access to a graph G is given via degree and neighbor queries. The problem of sampling a single edge in this model has been raised by Eden and Rosenbaum (SOSA 18). Let n and m denote the number of vertices and edges of G, respectively. Eden and Rosenbaum provided upper and lower bounds of ?^*(n/? m) for sampling a single edge in general graphs (where O^*(?) suppresses poly(1/?) and poly(log n) dependencies). We ask whether the query complexity lower bound for sampling a single edge can be circumvented when multiple samples are required. That is, can we get an improved amortized per-sample cost if we allow a preprocessing phase? We answer in the affirmative. We present an algorithm that, if one knows the number of required samples q in advance, has an overall cost that is sublinear in q, namely, O^*(? q ?(n/? m)), which is strictly preferable to O^*(q? (n/? m)) cost resulting from q invocations of the algorithm by Eden and Rosenbaum. Subsequent to a preliminary version of this work, T?tek and Thorup (arXiv, preprint) proved that this bound is essentially optimal

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Towards a Decomposition-Optimal Algorithm for Counting and Sampling Arbitrary Motifs in Sublinear Time

Author: Biswas Amartya Shankha
Eden Talya
Rubinfeld Ronitt
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2021)
Publication date: 01/01/2021
Field of study

We consider the problem of sampling and approximately counting an arbitrary given motif H in a graph G, where access to G is given via queries: degree, neighbor, and pair, as well as uniform edge sample queries. Previous algorithms for these tasks were based on a decomposition of H into a collection of odd cycles and stars, denoted D^*(H) = {O_{k?},...,O_{k_q}, S_{p?},...,S_{p_?}}. These algorithms were shown to be optimal for the case where H is a clique or an odd-length cycle, but no other lower bounds were known. We present a new algorithm for sampling arbitrary motifs which, up to poly(log n) factors, is always at least as good, and for most graphs G is strictly better. The main ingredient leading to this improvement is an improved uniform algorithm for sampling stars, which might be of independent interest, as it allows to sample vertices according to the p-th moment of the degree distribution. Finally, we prove that this algorithm is decomposition-optimal for decompositions that contain at least one odd cycle. These are the first lower bounds for motifs H with a nontrivial decomposition, i.e., motifs that have more than a single component in their decomposition

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Algorithms to Approximate Column-Sparse Packing Problems

Author: Brubach Brian
Sankararaman Karthik Abinav
Srinivasan Aravind
Xu Pan
Publication venue
Publication date: 05/08/2019
Field of study

Column-sparse packing problems arise in several contexts in both deterministic and stochastic discrete optimization. We present two unifying ideas, (non-uniform) attenuation and multiple-chance algorithms, to obtain improved approximation algorithms for some well-known families of such problems. As three main examples, we attain the integrality gap, up to lower-order terms, for known LP relaxations for k-column sparse packing integer programs (Bansal et al., Theory of Computing, 2012) and stochastic k-set packing (Bansal et al., Algorithmica, 2012), and go "half the remaining distance" to optimal for a major integrality-gap conjecture of Furedi, Kahn and Seymour on hypergraph matching (Combinatorica, 1993).Comment: Extended abstract appeared in SODA 2018. Full version in ACM Transactions of Algorithm

arXiv.org e-Print Archive

Crossref