Search CORE

10 research outputs found

Element Distinctness, Frequency Moments, and Sliding Windows

Author: Beame Paul
Clifford Raphael
Machmouchi Widad
Publication venue
Publication date: 14/09/2013
Field of study

We derive new time-space tradeoff lower bounds and algorithms for exactly computing statistics of input data, including frequency moments, element distinctness, and order statistics, that are simple to calculate for sorted data. We develop a randomized algorithm for the element distinctness problem whose time T and space S satisfy T in O (n^{3/2}/S^{1/2}), smaller than previous lower bounds for comparison-based algorithms, showing that element distinctness is strictly easier than sorting for randomized branching programs. This algorithm is based on a new time and space efficient algorithm for finding all collisions of a function f from a finite set to itself that are reachable by iterating f from a given set of starting points. We further show that our element distinctness algorithm can be extended at only a polylogarithmic factor cost to solve the element distinctness problem over sliding windows, where the task is to take an input of length 2n-1 and produce an output for each window of length n, giving n outputs in total. In contrast, we show a time-space tradeoff lower bound of T in Omega(n^2/S) for randomized branching programs to compute the number of distinct elements over sliding windows. The same lower bound holds for computing the low-order bit of F_0 and computing any frequency moment F_k, k neq 1. This shows that those frequency moments and the decision problem F_0 mod 2 are strictly harder than element distinctness. We complement this lower bound with a T in O(n^2/S) comparison-based deterministic RAM algorithm for exactly computing F_k over sliding windows, nearly matching both our lower bound for the sliding-window version and the comparison-based lower bounds for the single-window version. We further exhibit a quantum algorithm for F_0 over sliding windows with T in O(n^{3/2}/S^{1/2}). Finally, we consider the computations of order statistics over sliding windows.Comment: arXiv admin note: substantial text overlap with arXiv:1212.437

arXiv.org e-Print Archive

Crossref

Explore Bristol Research

Sublinear Space Algorithms for the Longest Common Substring Problem

Author: Kociumaka Tomasz
Starikovskaya Tatiana
Vildhøj Hjalte Wedel
Publication venue
Publication date: 01/01/2014
Field of study

Given

m

documents of total length

n

, we consider the problem of finding a longest string common to at least

d \geq 2

of the documents. This problem is known as the \emph{longest common substring (LCS) problem} and has a classic

O(n)

space and

O(n)

time solution (Weiner [FOCS'73], Hui [CPM'92]). However, the use of linear space is impractical in many applications. In this paper we show that for any trade-off parameter

1 \leq \tau \leq n

, the LCS problem can be solved in

O(\tau)

space and

O(n^2/\tau)

time, thus providing the first smooth deterministic time-space trade-off from constant to linear space. The result uses a new and very simple algorithm, which computes a

\tau

-additive approximation to the LCS in

O(n^2/\tau)

time and

O(1)

space. We also show a time-space trade-off lower bound for deterministic branching programs, which implies that any deterministic RAM algorithm solving the LCS problem on documents from a sufficiently large alphabet in

O(\tau)

space must use

\Omega(n\sqrt{\log(n/(\tau\log n))/\log\log(n/(\tau\log n)})

time.Comment: Accepted to 22nd European Symposium on Algorithm

arXiv.org e-Print Archive

Crossref

Online Research Database In Technology

Finding the Median (Obliviously) with Bounded Space

Author: A Borodin
A Borodin
AC-C Yao
E Okol’nishnikova
GN Frederickson
JI Munro
JI Munro
M Ajtai
M Blum
N Alon
P Beame
P Beame
P Beame
S Jukna
T Holenstein
TM Chan
Publication venue
Publication date: 01/05/2015
Field of study

We prove that any oblivious algorithm using space

S

to find the median of a list of

n

integers from

\{1,...,2n\}

requires time

\Omega(n \log\log_S n)

. This bound also applies to the problem of determining whether the median is odd or even. It is nearly optimal since Chan, following Munro and Raman, has shown that there is a (randomized) selection algorithm using only

s

registers, each of which can store an input value or

O(\log n)

-bit counter, that makes only

O(\log\log_s n)

passes over the input. The bound also implies a size lower bound for read-once branching programs computing the low order bit of the median and implies the analog of

P \ne NP \cap coNP

for length

o(n \log\log n)

oblivious branching programs

arXiv.org e-Print Archive

Crossref

Deterministic Time-Space Tradeoffs for k-SUM

Author: Lincoln Andrea
Wang Joshua R.
Williams R. Ryan
Williams Virginia Vassilevska
Publication venue
Publication date: 01/01/2016
Field of study

Given a set of numbers, the

k

-SUM problem asks for a subset of

k

numbers that sums to zero. When the numbers are integers, the time and space complexity of

k

-SUM is generally studied in the word-RAM model; when the numbers are reals, the complexity is studied in the real-RAM model, and space is measured by the number of reals held in memory at any point. We present a time and space efficient deterministic self-reduction for the

k

-SUM problem which holds for both models, and has many interesting consequences. To illustrate: *

3

-SUM is in deterministic time

O(n^2 \lg\lg(n)/\lg(n))

and space

O\left(\sqrt{\frac{n \lg(n)}{\lg\lg(n)}}\right)

. In general, any polylogarithmic-time improvement over quadratic time for

3

-SUM can be converted into an algorithm with an identical time improvement but low space complexity as well. *

3

-SUM is in deterministic time

O(n^2)

and space

O(\sqrt n)

, derandomizing an algorithm of Wang. * A popular conjecture states that 3-SUM requires

n^{2-o(1)}

time on the word-RAM. We show that the 3-SUM Conjecture is in fact equivalent to the (seemingly weaker) conjecture that every

O(n^{.51})

-space algorithm for

3

-SUM requires at least

n^{2-o(1)}

time on the word-RAM. * For

k \ge 4

k

-SUM is in deterministic

O(n^{k - 2 + 2/k})

time and

O(\sqrt{n})

space

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

The quantum complexity of approximating the frequency moments

Author: Montanaro Ashley
Publication venue
Publication date: 01/01/2015
Field of study

The

k

'th frequency moment of a sequence of integers is defined as

F_k = \sum_j n_j^k

, where

n_j

is the number of times that

j

occurs in the sequence. Here we study the quantum complexity of approximately computing the frequency moments in two settings. In the query complexity setting, we wish to minimise the number of queries to the input used to approximate

F_k

up to relative error

\epsilon

. We give quantum algorithms which outperform the best possible classical algorithms up to quadratically. In the multiple-pass streaming setting, we see the elements of the input one at a time, and seek to minimise the amount of storage space, or passes over the data, used to approximate

F_k

. We describe quantum algorithms for

F_0

F_2

and

F_\infty

in this model which substantially outperform the best possible classical algorithms in certain parameter regimes.Comment: 22 pages; v3: essentially published versio

arXiv.org e-Print Archive

CiteSeerX

Explore Bristol Research

Randomized vs. Deterministic Separation in Time-Space Tradeoffs of Multi-Output Functions

Author: Yu Huacheng
Zhan Wei
Publication venue
Publication date: 27/06/2023
Field of study

We prove the first polynomial separation between randomized and deterministic time-space tradeoffs of multi-output functions. In particular, we present a total function that on the input of

n

elements in

[n]

, outputs

O(n)

elements, such that: (1) There exists a randomized oblivious algorithm with space

O(\log n)

, time

O(n\log n)

and one-way access to randomness, that computes the function with probability

1-O(1/n)

; (2) Any deterministic oblivious branching program with space

S

and time

T

that computes the function must satisfy

T^2S\geq\Omega(n^{2.5}/\log n)

. This implies that logspace randomized algorithms for multi-output functions cannot be black-box derandomized without an

\widetilde{\Omega}(n^{1/4})

overhead in time. Since previously all the polynomial time-space tradeoffs of multi-output functions are proved via the Borodin-Cook method, which is a probabilistic method that inherently gives the same lower bound for randomized and deterministic branching programs, our lower bound proof is intrinsically different from previous works. We also examine other natural candidates for proving such separations, and show that any polynomial separation for these problems would resolve the long-standing open problem of proving

n^{1+\Omega(1)}

time lower bound for decision problems with

\mathrm{polylog}(n)

space.Comment: 15 page

arXiv.org e-Print Archive

Faster space-efficient algorithms for Subset Sum, k -Sum, and related problems

Author: Bansal N. (Nikhil)
Garg S. (Shashwat)
Nederlof J. (Jesper)
Vyas N. (Nikhil)
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 31/05/2014
Field of study

We present randomized algorithms that solve subset sum and knapsack instances with n items in O∗ (20.86n) time, where the O∗ (∙ ) notation suppresses factors polynomial in the input size, and polynomial space, assuming random read-only access to exponentially many random bits. These results can be extended to solve binary integer programming on n variables with few constraints in a similar running time. We also show that for any constant k ≥ 2, random instances of k-Sum can be solved using O(nk -0.5polylog(n)) time and O(log n) space, without the assumption of random access to random bits.Underlying these results is an algorithm that determines whether two given lists of length n with integers bounded by a polynomial in n share a common value. Assuming random read-only access to random bits, we show that this problem can be solved using O(log n) space significantly faster than the trivial O(n2) time algorithm if no value occurs too often in the same list.</p

CWI's Institutional Repository

Substring Complexity in Sublinear Space

Author: Bernardini Giulia
Fici Gabriele
Gawrychowski Paweł
Pissis Solon P.
Publication venue
Publication date: 16/07/2020
Field of study

Shannon's entropy is a definitive lower bound for statistical compression. Unfortunately, no such clear measure exists for the compressibility of repetitive strings. Thus, ad-hoc measures are employed to estimate the repetitiveness of strings, e.g., the size

z

of the Lempel-Ziv parse or the number

r

of equal-letter runs of the Burrows-Wheeler transform. A more recent one is the size

\gamma

of a smallest string attractor. Unfortunately, Kempa and Prezza [STOC 2018] showed that computing

\gamma

is NP-hard. Kociumaka et al. [LATIN 2020] considered a new measure that is based on the function

S_T

counting the cardinalities of the sets of substrings of each length of

T

, also known as the substring complexity. This new measure is defined as

\delta= \sup\{S_T(k)/k, k\geq 1\}

and lower bounds all the measures previously considered. In particular,

\delta\leq \gamma

always holds and

\delta

can be computed in

\mathcal{O}(n)

time using

\Omega(n)

working space. Kociumaka et al. showed that if

\delta

is given, one can construct an

\mathcal{O}(\delta \log \frac{n}{\delta})

-sized representation of

T

supporting efficient direct access and efficient pattern matching queries on

T

. Given that for highly compressible strings,

\delta

is significantly smaller than

n

, it is natural to pose the following question: Can we compute

\delta

efficiently using sublinear working space? It is straightforward to show that any algorithm computing

\delta

using

\mathcal{O}(b)

space requires

\Omega(n^{2-o(1)}/b)

time through a reduction from the element distinctness problem [Yao, SIAM J. Comput. 1994]. We present the following results: an

\mathcal{O}(n^3/b^2)

-time and

\mathcal{O}(b)

-space algorithm to compute

\delta

, for any

b\in[1,n]

; and an

\tilde{\mathcal{O}}(n^2/b)

-time and

\mathcal{O}(b)

-space algorithm to compute

\delta

, for any

b\in[n^{2/3},n]

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Trieste

CWI's Institutional Repository

27th Annual European Symposium on Algorithms: ESA 2019, September 9-11, 2019, Munich/Garching, Germany

Author: ESA <27. 2019, München>
Publication venue: Schloss Dagstuhl - Leibniz-Zentrum für Informatik GmbH, Dagstuhl Publishing
Publication date: 01/09/2019
Field of study

Digitale Bibliothek Thüringen