73 research outputs found

    The Arboricity Captures the Complexity of Sampling Edges

    Get PDF
    In this paper, we revisit the problem of sampling edges in an unknown graph G=(V,E)G = (V, E) from a distribution that is (pointwise) almost uniform over EE. We consider the case where there is some a priori upper bound on the arboriciy of GG. Given query access to a graph GG over nn vertices and of average degree dd and arboricity at most α\alpha, we design an algorithm that performs O ⁣(αdlog3nε)O\!\left(\frac{\alpha}{d} \cdot \frac{\log^3 n}{\varepsilon}\right) queries in expectation and returns an edge in the graph such that every edge eEe \in E is sampled with probability (1±ε)/m(1 \pm \varepsilon)/m. The algorithm performs two types of queries: degree queries and neighbor queries. We show that the upper bound is tight (up to poly-logarithmic factors and the dependence in ε\varepsilon), as Ω ⁣(αd)\Omega\!\left(\frac{\alpha}{d} \right) queries are necessary for the easier task of sampling edges from any distribution over EE that is close to uniform in total variational distance. We also prove that even if GG is a tree (i.e., α=1\alpha = 1 so that αd=Θ(1)\frac{\alpha}{d}=\Theta(1)), Ω(lognloglogn)\Omega\left(\frac{\log n}{\log\log n}\right) queries are necessary to sample an edge from any distribution that is pointwise close to uniform, thus establishing that a poly(logn)\mathrm{poly}(\log n) factor is necessary for constant α\alpha. Finally we show how our algorithm can be applied to obtain a new result on approximately counting subgraphs, based on the recent work of Assadi, Kapralov, and Khanna (ITCS, 2019)

    The Arboricity Captures the Complexity of Sampling Edges

    Get PDF
    In this paper, we revisit the problem of sampling edges in an unknown graph G = (V, E) from a distribution that is (pointwise) almost uniform over E. We consider the case where there is some a priori upper bound on the arboriciy of G. Given query access to a graph G over n vertices and of average degree {d} and arboricity at most alpha, we design an algorithm that performs O(alpha/d * {log^3 n}/epsilon) queries in expectation and returns an edge in the graph such that every edge e in E is sampled with probability (1 +/- epsilon)/m. The algorithm performs two types of queries: degree queries and neighbor queries. We show that the upper bound is tight (up to poly-logarithmic factors and the dependence in epsilon), as Omega(alpha/d) queries are necessary for the easier task of sampling edges from any distribution over E that is close to uniform in total variational distance. We also prove that even if G is a tree (i.e., alpha = 1 so that alpha/d = Theta(1)), Omega({log n}/{loglog n}) queries are necessary to sample an edge from any distribution that is pointwise close to uniform, thus establishing that a poly(log n) factor is necessary for constant alpha. Finally we show how our algorithm can be applied to obtain a new result on approximately counting subgraphs, based on the recent work of Assadi, Kapralov, and Khanna (ITCS, 2019)

    Testing Triangle Freeness in the General Model in Graphs with Arboricity O(?n)

    Get PDF
    We study the problem of testing triangle freeness in the general graph model. This problem was first studied in the general graph model by Alon et al. (SIAM J. Discret. Math. 2008) who provided both lower bounds and upper bounds that depend on the number of vertices and the average degree of the graph. Their bounds are tight only when d_max = O(d) and ?{d} ? ?n or when ?{d} = ?(1), where d_max denotes the maximum degree and ?{d} denotes the average degree of the graph. In this paper we provide bounds that depend on the arboricity of the graph and the average degree. As in Alon et al., the parameters of our tester is the number of vertices, n, the number of edges, m, and the proximity parameter ? (the arboricity of the graph is not a parameter of the algorithm). The query complexity of our tester is O?(?/ ?{d} + ?)? poly(1/?) on expectation, where ? denotes the arboricity of the input graph (we use O?(?) to suppress O(log log n) factors). We show that for graphs with arboricity O(?n) this upper bound is tight in the following sense. For any ? ? [s] where s = ?(?n) there exists a family of graphs with arboricity ? and average degree ?{d} such that ?(?/ ?{d} + ?) queries are required for testing triangle freeness on this family of graphs. Moreover, this lower bound holds for any such ? and for a large range of feasible average degrees

    Sampling Multiple Edges Efficiently

    Get PDF
    We present a sublinear time algorithm that allows one to sample multiple edges from a distribution that is pointwise ?-close to the uniform distribution, in an amortized-efficient fashion. We consider the adjacency list query model, where access to a graph G is given via degree and neighbor queries. The problem of sampling a single edge in this model has been raised by Eden and Rosenbaum (SOSA 18). Let n and m denote the number of vertices and edges of G, respectively. Eden and Rosenbaum provided upper and lower bounds of ?^*(n/? m) for sampling a single edge in general graphs (where O^*(?) suppresses poly(1/?) and poly(log n) dependencies). We ask whether the query complexity lower bound for sampling a single edge can be circumvented when multiple samples are required. That is, can we get an improved amortized per-sample cost if we allow a preprocessing phase? We answer in the affirmative. We present an algorithm that, if one knows the number of required samples q in advance, has an overall cost that is sublinear in q, namely, O^*(? q ?(n/? m)), which is strictly preferable to O^*(q? (n/? m)) cost resulting from q invocations of the algorithm by Eden and Rosenbaum. Subsequent to a preliminary version of this work, T?tek and Thorup (arXiv, preprint) proved that this bound is essentially optimal

    Towards a Decomposition-Optimal Algorithm for Counting and Sampling Arbitrary Motifs in Sublinear Time

    Get PDF
    We consider the problem of sampling and approximately counting an arbitrary given motif H in a graph G, where access to G is given via queries: degree, neighbor, and pair, as well as uniform edge sample queries. Previous algorithms for these tasks were based on a decomposition of H into a collection of odd cycles and stars, denoted D^*(H) = {O_{k?},...,O_{k_q}, S_{p?},...,S_{p_?}}. These algorithms were shown to be optimal for the case where H is a clique or an odd-length cycle, but no other lower bounds were known. We present a new algorithm for sampling arbitrary motifs which, up to poly(log n) factors, is always at least as good, and for most graphs G is strictly better. The main ingredient leading to this improvement is an improved uniform algorithm for sampling stars, which might be of independent interest, as it allows to sample vertices according to the p-th moment of the degree distribution. Finally, we prove that this algorithm is decomposition-optimal for decompositions that contain at least one odd cycle. These are the first lower bounds for motifs H with a nontrivial decomposition, i.e., motifs that have more than a single component in their decomposition

    Algorithms to Approximate Column-Sparse Packing Problems

    Full text link
    Column-sparse packing problems arise in several contexts in both deterministic and stochastic discrete optimization. We present two unifying ideas, (non-uniform) attenuation and multiple-chance algorithms, to obtain improved approximation algorithms for some well-known families of such problems. As three main examples, we attain the integrality gap, up to lower-order terms, for known LP relaxations for k-column sparse packing integer programs (Bansal et al., Theory of Computing, 2012) and stochastic k-set packing (Bansal et al., Algorithmica, 2012), and go "half the remaining distance" to optimal for a major integrality-gap conjecture of Furedi, Kahn and Seymour on hypergraph matching (Combinatorica, 1993).Comment: Extended abstract appeared in SODA 2018. Full version in ACM Transactions of Algorithm
    corecore