Search CORE

89 research outputs found

Recognizing well-parenthesized expressions in the streaming model

Author: Magniez F.
Mathieu C.
Nayak A.
Publication venue
Publication date: 17/11/2009
Field of study

Motivated by a concrete problem and with the goal of understanding the sense in which the complexity of streaming algorithms is related to the complexity of formal languages, we investigate the problem Dyck(s) of checking matching parentheses, with

s

different types of parenthesis. We present a one-pass randomized streaming algorithm for Dyck(2) with space \Order(\sqrt{n}\log n), time per letter \polylog (n), and one-sided error. We prove that this one-pass algorithm is optimal, up to a \polylog n factor, even when two-sided error is allowed. For the lower bound, we prove a direct sum result on hard instances by following the "information cost" approach, but with a few twists. Indeed, we play a subtle game between public and private coins. This mixture between public and private coins results from a balancing act between the direct sum result and a combinatorial lower bound for the base case. Surprisingly, the space requirement shrinks drastically if we have access to the input stream in reverse. We present a two-pass randomized streaming algorithm for Dyck(2) with space \Order((\log n)^2), time \polylog (n) and one-sided error, where the second pass is in the reverse direction. Both algorithms can be extended to Dyck(s) since this problem is reducible to Dyck(2) for a suitable notion of reduction in the streaming model.Comment: 20 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

Streaming Complexity of Checking Priority Queues

Author: Francois Nathanael
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 30th International Symposium on Theoretical Aspects of Computer Science (STACS 2013)
Publication date: 01/01/2013
Field of study

This work is in the line of designing efficient checkers for testing the reliability of some massive data structures. Given a sequential access to the insert/extract operations on such a structure, one would like to decide, a posteriori only, if it corresponds to the evolution of a reliable structure. In a context of massive data, one would like to minimize both the amount of reliable memory of the checker and the number of passes on the sequence of operations. Chu, Kannan and McGregor (M. Chu, S. Kannan, and A. McGregor, 2007) initiated the study of checking priority queues in this setting. They showed that the use of timestamps allows to check a priority queue with a single pass and memory space tilde{Order}(sqrt{N}). Later, Chakrabarti, Cormode, Kondapally and McGregor (A. Chakrabarti, G. Cormode, R. Kondapally, and A. McGregor, 2010) removed the use of timestamps, and proved that more passes do not help. We show that, even in the presence of timestamps, more passes do not help, solving an open problem of (M. Chu, S. Kannan, and A. McGregor, 2007; A. Chakrabarti, G. Cormode, R. Kondapally, and A. McGregor). On the other hand, we show that a second pass, but in reverse direction shrinks the memory space to tilde{Order}((log N)^2), extending a phenomenon the first time observed by Magniez, Mathieu and Nayak (F. Magniez, C. Mathieu, and A. Nayak, 2010) for checking well-parenthesized expressions

Dagstuhl Research Online Publication Server

Streaming Property Testing of Visibly Pushdown Languages

Author: de Rougemont Michel
François Nathanaël
Magniez Frédéric
Serre Olivier
Publication venue
Publication date: 03/11/2015
Field of study

In the context of language recognition, we demonstrate the superiority of streaming property testers against streaming algorithms and property testers, when they are not combined. Initiated by Feigenbaum et al., a streaming property tester is a streaming algorithm recognizing a language under the property testing approximation: it must distinguish inputs of the language from those that are

\varepsilon

-far from it, while using the smallest possible memory (rather than limiting its number of input queries). Our main result is a streaming

\varepsilon

-property tester for visibly pushdown languages (VPL) with one-sided error using memory space

\mathrm{poly}((\log n) / \varepsilon)

. This constructions relies on a (non-streaming) property tester for weighted regular languages based on a previous tester by Alon et al. We provide a simple application of this tester for streaming testing special cases of instances of VPL that are already hard for both streaming algorithms and property testers. Our main algorithm is a combination of an original simulation of visibly pushdown automata using a stack with small height but possible items of linear size. In a second step, those items are replaced by small sketches. Those sketches relies on a notion of suffix-sampling we introduce. This sampling is the key idea connecting our streaming tester algorithm to property testers.Comment: 23 pages. Major modifications in the presentatio

arXiv.org e-Print Archive

HAL Descartes

Hal-Diderot

Streaming algorithms for language recognition problems

Author: Babu Ajesh
Limaye Nutan
Radhakrishnan Jaikumar
Varma Girish
Publication venue
Publication date: 01/01/2011
Field of study

We study the complexity of the following problems in the streaming model. Membership testing for \DLIN We show that every language in \DLIN\ can be recognised by a randomized one-pass

O(\log n)

space algorithm with inverse polynomial one-sided error, and by a deterministic p-pass

O(n/p)

space algorithm. We show that these algorithms are optimal. Membership testing for \LL

(k)

For languages generated by \LL

(k)

grammars with a bound of

r

on the number of nonterminals at any stage in the left-most derivation, we show that membership can be tested by a randomized one-pass

O(r\log n)

space algorithm with inverse polynomial (in

n

) one-sided error. Membership testing for \DCFL We show that randomized algorithms as efficient as the ones described above for \DLIN\ and \LL(k) (which are subclasses of \DCFL) cannot exist for all of \DCFL: there is a language in \VPL\ (a subclass of \DCFL) for which any randomized p-pass algorithm with error bounded by

\epsilon < 1/2

must use

\Omega(n/p)

space. Degree sequence problem We study the problem of determining, given a sequence

d_1, d_2,..., d_n

and a graph

G

, whether the degree sequence of

G

is precisely

d_1, d_2,..., d_n

. We give a randomized one-pass

O(\log n)

space algorithm with inverse polynomial one-sided error probability. We show that our algorithms are optimal. Our randomized algorithms are based on the recent work of Magniez et al. \cite{MMN09}; our lower bounds are obtained by considering related communication complexity problems

arXiv.org e-Print Archive

Improved bounds for testing Dyck languages

Author: Fischer Eldar
Magniez Frédéric
Starikovskaya Tatiana
Publication venue
Publication date: 20/07/2017
Field of study

In this paper we consider the problem of deciding membership in Dyck languages, a fundamental family of context-free languages, comprised of well-balanced strings of parentheses. In this problem we are given a string of length

n

in the alphabet of parentheses of

m

types and must decide if it is well-balanced. We consider this problem in the property testing setting, where one would like to make the decision while querying as few characters of the input as possible. Property testing of strings for Dyck language membership for

m=1

, with a number of queries independent of the input size

n

, was provided in [Alon, Krivelevich, Newman and Szegedy, SICOMP 2001]. Property testing of strings for Dyck language membership for

m \ge 2

was first investigated in [Parnas, Ron and Rubinfeld, RSA 2003]. They showed an upper bound and a lower bound for distinguishing strings belonging to the language from strings that are far (in terms of the Hamming distance) from the language, which are respectively (up to polylogarithmic factors) the

2/3

power and the

1/11

power of the input size

n

. Here we improve the power of

n

in both bounds. For the upper bound, we introduce a recursion technique, that together with a refinement of the methods in the original work provides a test for any power of

n

larger than

2/5

. For the lower bound, we introduce a new problem called Truestring Equivalence, which is easily reducible to the

2

-type Dyck language property testing problem. For this new problem, we show a lower bound of

n

to the power of

1/5

arXiv.org e-Print Archive

Hal-Diderot

Information Cost Tradeoffs for Augmented Index and Streaming Language Recognition

Author: Chakrabarti Amit
Cormode Graham
Kondapally Ranganath
McGregor Andrew
Publication venue
Publication date: 01/01/2010
Field of study

This paper makes three main contributions to the theory of communication complexity and stream computation. First, we present new bounds on the information complexity of AUGMENTED-INDEX. In contrast to analogous results for INDEX by Jain, Radhakrishnan and Sen [J. ACM, 2009], we have to overcome the significant technical challenge that protocols for AUGMENTED-INDEX may violate the "rectangle property" due to the inherent input sharing. Second, we use these bounds to resolve an open problem of Magniez, Mathieu and Nayak [STOC, 2010] that asked about the multi-pass complexity of recognizing Dyck languages. This results in a natural separation between the standard multi-pass model and the multi-pass model that permits reverse passes. Third, we present the first passive memory checkers that verify the interaction transcripts of priority queues, stacks, and double-ended queues. We obtain tight upper and lower bounds for these problems, thereby addressing an important sub-class of the memory checking framework of Blum et al. [Algorithmica, 1994]

arXiv.org e-Print Archive

CiteSeerX

ScholarWorks@UMass Amherst

Dartmouth Digital Commons (Dartmouth College)