    Testing Small Set Expansion in General Graphs

    We consider the problem of testing small set expansion for general graphs. A graph GG is a (k,ϕ)(k,\phi)-expander if every subset of volume at most kk has conductance at least ϕ\phi. Small set expansion has recently received significant attention due to its close connection to the unique games conjecture, the local graph partitioning algorithms and locally testable codes. We give testers with two-sided error and one-sided error in the adjacency list model that allows degree and neighbor queries to the oracle of the input graph. The testers take as input an nn-vertex graph GG, a volume bound kk, an expansion bound ϕ\phi and a distance parameter ε>0\varepsilon>0. For the two-sided error tester, with probability at least 2/32/3, it accepts the graph if it is a (k,ϕ)(k,\phi)-expander and rejects the graph if it is ε\varepsilon-far from any (k,ϕ)(k^*,\phi^*)-expander, where k=Θ(kε)k^*=\Theta(k\varepsilon) and ϕ=Θ(ϕ4min{log(4m/k),logn}(lnk))\phi^*=\Theta(\frac{\phi^4}{\min\{\log(4m/k),\log n\}\cdot(\ln k)}). The query complexity and running time of the tester are O~(mϕ4ε2)\widetilde{O}(\sqrt{m}\phi^{-4}\varepsilon^{-2}), where mm is the number of edges of the graph. For the one-sided error tester, it accepts every (k,ϕ)(k,\phi)-expander, and with probability at least 2/32/3, rejects every graph that is ε\varepsilon-far from (k,ϕ)(k^*,\phi^*)-expander, where k=O(k1ξ)k^*=O(k^{1-\xi}) and ϕ=O(ξϕ2)\phi^*=O(\xi\phi^2) for any 0<ξ<10<\xi<1. The query complexity and running time of this tester are O~(nε3+kεϕ4)\widetilde{O}(\sqrt{\frac{n}{\varepsilon^3}}+\frac{k}{\varepsilon \phi^4}). We also give a two-sided error tester with smaller gap between ϕ\phi^* and ϕ\phi in the rotation map model that allows (neighbor, index) queries and degree queries.Comment: 23 pages; STACS 201

    Bidirectional PageRank Estimation: From Average-Case to Worst-Case

    We present a new algorithm for estimating the Personalized PageRank (PPR) between a source and target node on undirected graphs, with sublinear running-time guarantees over the worst-case choice of source and target nodes. Our work builds on a recent line of work on bidirectional estimators for PPR, which obtained sublinear running-time guarantees but in an average-case sense, for a uniformly random choice of target node. Crucially, we show how the reversibility of random walks on undirected networks can be exploited to convert average-case to worst-case guarantees. While past bidirectional methods combine forward random walks with reverse local pushes, our algorithm combines forward local pushes with reverse random walks. We also discuss how to modify our methods to estimate random-walk probabilities for any length distribution, thereby obtaining fast algorithms for estimating general graph diffusions, including the heat kernel, on undirected networks.Comment: Workshop on Algorithms and Models for the Web-Graph (WAW) 201

    AMS Without 4-Wise Independence on Product Domains

    In their seminal work, Alon, Matias, and Szegedy introduced several sketching techniques, including showing that 4-wise independence is sufficient to obtain good approximations of the second frequency moment. In this work, we show that their sketching technique can be extended to product domains [n]k[n]^k by using the product of 4-wise independent functions on [n][n]. Our work extends that of Indyk and McGregor, who showed the result for k=2k = 2. Their primary motivation was the problem of identifying correlations in data streams. In their model, a stream of pairs (i,j)[n]2(i,j) \in [n]^2 arrive, giving a joint distribution (X,Y)(X,Y), and they find approximation algorithms for how close the joint distribution is to the product of the marginal distributions under various metrics, which naturally corresponds to how close XX and YY are to being independent. By using our technique, we obtain a new result for the problem of approximating the 2\ell_2 distance between the joint distribution and the product of the marginal distributions for kk-ary vectors, instead of just pairs, in a single pass. Our analysis gives a randomized algorithm that is a (1±ϵ)(1 \pm \epsilon) approximation (with probability 1δ1-\delta) that requires space logarithmic in nn and mm and proportional to 3k3^k