Search CORE

3 research outputs found

Streaming Euclidean Max-Cut: Dimension vs Data Reduction

Author: Chen Xiaoyu
Jiang Shaofeng H. -C.
Krauthgamer Robert
Publication venue
Publication date: 29/03/2023
Field of study

Max-Cut is a fundamental problem that has been studied extensively in various settings. We design an algorithm for Euclidean Max-Cut, where the input is a set of points in

\mathbb{R}^d

, in the model of dynamic geometric streams, where the input

X\subseteq [\Delta]^d

is presented as a sequence of point insertions and deletions. Previously, Frahling and Sohler [STOC 2005] designed a

(1+\epsilon)

-approximation algorithm for the low-dimensional regime, i.e., it uses space

\exp(d)

. To tackle this problem in the high-dimensional regime, which is of growing interest, one must improve the dependence on the dimension

d

, ideally to space complexity

\mathrm{poly}(\epsilon^{-1} d \log\Delta)

. Lammersen, Sidiropoulos, and Sohler [WADS 2009] proved that Euclidean Max-Cut admits dimension reduction with target dimension

d' = \mathrm{poly}(\epsilon^{-1})

. Combining this with the aforementioned algorithm that uses space

\exp(d')

, they obtain an algorithm whose overall space complexity is indeed polynomial in

d

, but unfortunately exponential in

\epsilon^{-1}

. We devise an alternative approach of \emph{data reduction}, based on importance sampling, and achieve space bound

\mathrm{poly}(\epsilon^{-1} d \log\Delta)

, which is exponentially better (in

\epsilon

) than the dimension-reduction approach. To implement this scheme in the streaming model, we employ a randomly-shifted quadtree to construct a tree embedding. While this is a well-known method, a key feature of our algorithm is that the embedding's distortion

O(d\log\Delta)

affects only the space complexity, and the approximation ratio remains

1+\epsilon

arXiv.org e-Print Archive

Tight Bounds for Adversarially Robust Streams and Sliding Windows via Difference Estimators

Author: Woodruff David P.
Zhou Samson
Publication venue
Publication date: 23/11/2021
Field of study

In the adversarially robust streaming model, a stream of elements is presented to an algorithm and is allowed to depend on the output of the algorithm at earlier times during the stream. In the classic insertion-only model of data streams, Ben-Eliezer et. al. (PODS 2020, best paper award) show how to convert a non-robust algorithm into a robust one with a roughly

1/\varepsilon

factor overhead. This was subsequently improved to a

1/\sqrt{\varepsilon}

factor overhead by Hassidim et. al. (NeurIPS 2020, oral presentation), suppressing logarithmic factors. For general functions the latter is known to be best-possible, by a result of Kaplan et. al. (CRYPTO 2021). We show how to bypass this impossibility result by developing data stream algorithms for a large class of streaming problems, with no overhead in the approximation factor. Our class of streaming problems includes the most well-studied problems such as the

L_2

-heavy hitters problem,

F_p

-moment estimation, as well as empirical entropy estimation. We substantially improve upon all prior work on these problems, giving the first optimal dependence on the approximation factor. As in previous work, we obtain a general transformation that applies to any non-robust streaming algorithm and depends on the so-called flip number. However, the key technical innovation is that we apply the transformation to what we call a difference estimator for the streaming problem, rather than an estimator for the streaming problem itself. We then develop the first difference estimators for a wide range of problems. Our difference estimator methodology is not only applicable to the adversarially robust model, but to other streaming models where temporal properties of the data play a central role. (Abstract shortened to meet arXiv limit.)Comment: FOCS 202

arXiv.org e-Print Archive