Search CORE

358 research outputs found

Densest Subgraph in Dynamic Graph Streams

Author: A McGregor
AC Gilbert
B Bahmani
B Bahmani
G Cormode
G Cormode
G Gallo
K Kutzkov
KJ Ahn
M Charikar
M Mitzenmacher
S Khuller
V Lee
Publication venue
Publication date: 14/06/2015
Field of study

In this paper, we consider the problem of approximating the densest subgraph in the dynamic graph stream model. In this model of computation, the input graph is defined by an arbitrary sequence of edge insertions and deletions and the goal is to analyze properties of the resulting graph given memory that is sub-linear in the size of the stream. We present a single-pass algorithm that returns a

(1+\epsilon)

approximation of the maximum density with high probability; the algorithm uses O(\epsilon^{-2} n \polylog n) space, processes each stream update in \polylog (n) time, and uses \poly(n) post-processing time where

n

is the number of nodes. The space used by our algorithm matches the lower bound of Bahmani et al.~(PVLDB 2012) up to a poly-logarithmic factor for constant

\epsilon

. The best existing results for this problem were established recently by Bhattacharya et al.~(STOC 2015). They presented a

(2+\epsilon)

approximation algorithm using similar space and another algorithm that both processed each update and maintained a

(4+\epsilon)

approximation of the current maximum density in \polylog (n) time per-update.Comment: To appear in MFCS 201

arXiv.org e-Print Archive

Crossref

Online Row Sampling

Author: Cohen Michael B.
Musco Cameron
Pachocki Jakub
Publication venue
Publication date: 01/01/2016
Field of study

Finding a small spectral approximation for a tall

n \times d

matrix

A

is a fundamental numerical primitive. For a number of reasons, one often seeks an approximation whose rows are sampled from those of

A

. Row sampling improves interpretability, saves space when

A

is sparse, and preserves row structure, which is especially important, for example, when

A

represents a graph. However, correctly sampling rows from

A

can be costly when the matrix is large and cannot be stored and processed in memory. Hence, a number of recent publications focus on row sampling in the streaming setting, using little more space than what is required to store the outputted approximation [KL13, KLM+14]. Inspired by a growing body of work on online algorithms for machine learning and data analysis, we extend this work to a more restrictive online setting: we read rows of

A

one by one and immediately decide whether each row should be kept in the spectral approximation or discarded, without ever retracting these decisions. We present an extremely simple algorithm that approximates

A

up to multiplicative error

\epsilon

and additive error

\delta

using

O(d \log d \log(\epsilon||A||_2/\delta)/\epsilon^2)

online samples, with memory overhead proportional to the cost of storing the spectral approximation. We also present an algorithm that uses

O(d^2

) memory but only requires

O(d\log(\epsilon||A||_2/\delta)/\epsilon^2)

samples, which we show is optimal. Our methods are clean and intuitive, allow for lower memory usage than prior work, and expose new theoretical properties of leverage score based matrix approximation

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Space-Efficient Interior Point Method, with Applications to Linear Programming and Maximum Weight Bipartite Matching

Author: Liu S. Cliff
Song Zhao
Zhang Hengjie
Zhang Lichen
Zhou Tianyi
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 50th International Colloquium on Automata, Languages, and Programming (ICALP 2023)
Publication date: 01/01/2023
Field of study

Dagstuhl Research Online Publication Server

Expander Decomposition in Dynamic Streams

Author: Filtser Arnold
Kapralov Michael
Makarov Mikhail
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 14th Innovations in Theoretical Computer Science Conference (ITCS 2023)
Publication date: 01/01/2023
Field of study

In this paper we initiate the study of expander decompositions of a graph G = (V, E) in the streaming model of computation. The goal is to find a partitioning ? of vertices V such that the subgraphs of G induced by the clusters C ? ? are good expanders, while the number of intercluster edges is small. Expander decompositions are classically constructed by a recursively applying balanced sparse cuts to the input graph. In this paper we give the first implementation of such a recursive sparsest cut process using small space in the dynamic streaming model. Our main algorithmic tool is a new type of cut sparsifier that we refer to as a power cut sparsifier - it preserves cuts in any given vertex induced subgraph (or, any cluster in a fixed partition of V) to within a (?, ?)-multiplicative/additive error with high probability. The power cut sparsifier uses O?(n/??) space and edges, which we show is asymptotically tight up to polylogarithmic factors in n for constant ?

Dagstuhl Research Online Publication Server

Sublinear Estimation of Weighted Matchings in Dynamic Data Streams

Author: A. McGregor
A. McGregor
C. Konrad
D. Gavinsky
J. Feigenbaum
K. Ahn
K. Ahn
L. Epstein
M. Crouch
M. Zelke
N. Nisan
R. Uehara
W. Tutte
Z. Bar-Yossef
Publication venue
Publication date: 01/01/2015
Field of study

This paper presents an algorithm for estimating the weight of a maximum weighted matching by augmenting any estimation routine for the size of an unweighted matching. The algorithm is implementable in any streaming model including dynamic graph streams. We also give the first constant estimation for the maximum matching size in a dynamic graph stream for planar graphs (or any graph with bounded arboricity) using

\tilde{O}(n^{4/5})

space which also extends to weighted matching. Using previous results by Kapralov, Khanna, and Sudan (2014) we obtain a

\mathrm{polylog}(n)

approximation for general graphs using

\mathrm{polylog}(n)

space in random order streams, respectively. In addition, we give a space lower bound of

\Omega(n^{1-\varepsilon})

for any randomized algorithm estimating the size of a maximum matching up to a

1+O(\varepsilon)

factor for adversarial streams

arXiv.org e-Print Archive

Crossref

Archivio della ricerca- Università di Roma La Sapienza