Search CORE

47 research outputs found

Recognizing well-parenthesized expressions in the streaming model

Author: Magniez F.
Mathieu C.
Nayak A.
Publication venue
Publication date: 17/11/2009
Field of study

Motivated by a concrete problem and with the goal of understanding the sense in which the complexity of streaming algorithms is related to the complexity of formal languages, we investigate the problem Dyck(s) of checking matching parentheses, with

s

different types of parenthesis. We present a one-pass randomized streaming algorithm for Dyck(2) with space \Order(\sqrt{n}\log n), time per letter \polylog (n), and one-sided error. We prove that this one-pass algorithm is optimal, up to a \polylog n factor, even when two-sided error is allowed. For the lower bound, we prove a direct sum result on hard instances by following the "information cost" approach, but with a few twists. Indeed, we play a subtle game between public and private coins. This mixture between public and private coins results from a balancing act between the direct sum result and a combinatorial lower bound for the base case. Surprisingly, the space requirement shrinks drastically if we have access to the input stream in reverse. We present a two-pass randomized streaming algorithm for Dyck(2) with space \Order((\log n)^2), time \polylog (n) and one-sided error, where the second pass is in the reverse direction. Both algorithms can be extended to Dyck(s) since this problem is reducible to Dyck(2) for a suitable notion of reduction in the streaming model.Comment: 20 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

Streaming Complexity of Checking Priority Queues

Author: Francois Nathanael
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 30th International Symposium on Theoretical Aspects of Computer Science (STACS 2013)
Publication date: 01/01/2013
Field of study

This work is in the line of designing efficient checkers for testing the reliability of some massive data structures. Given a sequential access to the insert/extract operations on such a structure, one would like to decide, a posteriori only, if it corresponds to the evolution of a reliable structure. In a context of massive data, one would like to minimize both the amount of reliable memory of the checker and the number of passes on the sequence of operations. Chu, Kannan and McGregor (M. Chu, S. Kannan, and A. McGregor, 2007) initiated the study of checking priority queues in this setting. They showed that the use of timestamps allows to check a priority queue with a single pass and memory space tilde{Order}(sqrt{N}). Later, Chakrabarti, Cormode, Kondapally and McGregor (A. Chakrabarti, G. Cormode, R. Kondapally, and A. McGregor, 2010) removed the use of timestamps, and proved that more passes do not help. We show that, even in the presence of timestamps, more passes do not help, solving an open problem of (M. Chu, S. Kannan, and A. McGregor, 2007; A. Chakrabarti, G. Cormode, R. Kondapally, and A. McGregor). On the other hand, we show that a second pass, but in reverse direction shrinks the memory space to tilde{Order}((log N)^2), extending a phenomenon the first time observed by Magniez, Mathieu and Nayak (F. Magniez, C. Mathieu, and A. Nayak, 2010) for checking well-parenthesized expressions

Dagstuhl Research Online Publication Server

Streaming Property Testing of Visibly Pushdown Languages

Author: de Rougemont Michel
Serre Olivier
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 24th Annual European Symposium on Algorithms (ESA 2016)
Publication date: 01/01/2016
Field of study

In the context of formal language recognition, we demonstrate the superiority of streaming property testers against streaming algorithms and property testers, when they are not combined. Initiated by Feigenbaum et al., a streaming property tester is a streaming algorithm recognizing a language under the property testing approximation: it must distinguish inputs of the language from those that are eps-far from it, while using the smallest possible memory (rather than limiting its number of input queries). Our main result is a streaming eps-property tester for visibly pushdown languages (V_{PL}) with memory space poly(log n /epsilon). Our construction is done in three steps. First, we simulate a visibly pushdown automaton in one pass using a stack of small height but whose items can be of linear size. In a second step, those items are replaced by small sketches. Those sketches rely on a notion of suffix-sampling we introduce. This sampling is the key idea for taking benefit of both streaming algorithms and property testers in the third step. Indeed, the last step relies on a (non-streaming) property tester for weighted regular languages based on a previous tester by Alon et al. This tester can directly be used for streaming testing special cases of instances of V_{PL} that are already hard for both streaming algorithms and property testers. We then use it to decide the correctness of completed items, given their sketches, before removing them from the stack

Dagstuhl Research Online Publication Server

Incidence Geometries and the Pass Complexity of Semi-Streaming Set Cover

Author: Chakrabarti Amit
Wirth Anthony
Publication venue
Publication date: 16/07/2015
Field of study

Set cover, over a universe of size

n

, may be modelled as a data-streaming problem, where the

m

sets that comprise the instance are to be read one by one. A semi-streaming algorithm is allowed only

O(n\, \mathrm{poly}\{\log n, \log m\})

space to process this stream. For each

p \ge 1

, we give a very simple deterministic algorithm that makes

p

passes over the input stream and returns an appropriately certified

(p+1)n^{1/(p+1)}

-approximation to the optimum set cover. More importantly, we proceed to show that this approximation factor is essentially tight, by showing that a factor better than

0.99\,n^{1/(p+1)}/(p+1)^2

is unachievable for a

p

-pass semi-streaming algorithm, even allowing randomisation. In particular, this implies that achieving a

\Theta(\log n)

-approximation requires

\Omega(\log n/\log\log n)

passes, which is tight up to the

\log\log n

factor. These results extend to a relaxation of the set cover problem where we are allowed to leave an

\varepsilon

fraction of the universe uncovered: the tight bounds on the best approximation factor achievable in

p

passes turn out to be

\Theta_p(\min\{n^{1/(p+1)}, \varepsilon^{-1/p}\})

. Our lower bounds are based on a construction of a family of high-rank incidence geometries, which may be thought of as vast generalisations of affine planes. This construction, based on algebraic techniques, appears flexible enough to find other applications and is therefore interesting in its own right.Comment: 20 page

arXiv.org e-Print Archive

Quantum Chebyshev's Inequality and Applications

Author: Hamoudi Yassine
Magniez Frédéric
Publication venue
Publication date: 07/01/2019
Field of study

In this paper we provide new quantum algorithms with polynomial speed-up for a range of problems for which no such results were known, or we improve previous algorithms. First, we consider the approximation of the frequency moments

F_k

of order

k \geq 3

in the multi-pass streaming model with updates (turnstile model). We design a

P

-pass quantum streaming algorithm with memory

M

satisfying a tradeoff of

P^2 M = \tilde{O}(n^{1-2/k})

, whereas the best classical algorithm requires

P M = \Theta(n^{1-2/k})

. Then, we study the problem of estimating the number

m

of edges and the number

t

of triangles given query access to an

n

-vertex graph. We describe optimal quantum algorithms that perform

\tilde{O}(\sqrt{n}/m^{1/4})

and

\tilde{O}(\sqrt{n}/t^{1/6} + m^{3/4}/\sqrt{t})

queries respectively. This is a quadratic speed-up compared to the classical complexity of these problems. For this purpose we develop a new quantum paradigm that we call Quantum Chebyshev's inequality. Namely we demonstrate that, in a certain model of quantum sampling, one can approximate with relative error the mean of any random variable with a number of quantum samples that is linear in the ratio of the square root of the variance to the mean. Classically the dependency is quadratic. Our algorithm subsumes a previous result of Montanaro [Mon15]. This new paradigm is based on a refinement of the Amplitude Estimation algorithm of Brassard et al. [BHMT02] and of previous quantum algorithms for the mean estimation problem. We show that this speed-up is optimal, and we identify another common model of quantum sampling where it cannot be obtained. For our applications, we also adapt the variable-time amplitude amplification technique of Ambainis [Amb10] into a variable-time amplitude estimation algorithm.Comment: 27 pages; v3: better presentation, lower bound in Theorem 4.3 is ne

arXiv.org e-Print Archive

Hal-Diderot