Search CORE

1,105 research outputs found

Higher-Order Operator Precedence Languages

Author: Pradella Matteo
Reghizzi Stefano Crespi
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2017
Field of study

Floyd's Operator Precedence (OP) languages are a deterministic context-free family having many desirable properties. They are locally and parallely parsable, and languages having a compatible structure are closed under Boolean operations, concatenation and star; they properly include the family of Visibly Pushdown (or Input Driven) languages. OP languages are based on three relations between any two consecutive terminal symbols, which assign syntax structure to words. We extend such relations to k-tuples of consecutive terminal symbols, by using the model of strictly locally testable regular languages of order k at least 3. The new corresponding class of Higher-order Operator Precedence languages (HOP) properly includes the OP languages, and it is still included in the deterministic (also in reverse) context free family. We prove Boolean closure for each subfamily of structurally compatible HOP languages. In each subfamily, the top language is called max-language. We show that such languages are defined by a simple cancellation rule and we prove several properties, in particular that max-languages make an infinite hierarchy ordered by parameter k. HOP languages are a candidate for replacing OP languages in the various applications where they have have been successful though sometimes too restrictive.Comment: In Proceedings AFL 2017, arXiv:1708.0622

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Politecnico di Milano

Crossref

On the Hierarchy of Block Deterministic Languages

Author: A Brüggemann-Klein
D Giammarresi
EF Moore
JE Hopcroft
MO Rabin
P Caron
P Caron
S Kleene
VM Glushkov
YS Han
Publication venue
Publication date: 01/01/2015
Field of study

A regular language is

k

-lookahead deterministic (resp.

k

-block deterministic) if it is specified by a

k

-lookahead deterministic (resp.

k

-block deterministic) regular expression. These two subclasses of regular languages have been respectively introduced by Han and Wood (

k

-lookahead determinism) and by Giammarresi et al. (

k

-block determinism) as a possible extension of one-unambiguous languages defined and characterized by Br\"uggemann-Klein and Wood. In this paper, we study the hierarchy and the inclusion links of these families. We first show that each

k

-block deterministic language is the alphabetic image of some one-unambiguous language. Moreover, we show that the conversion from a minimal DFA of a

k

-block deterministic regular language to a

k

-block deterministic automaton not only requires state elimination, and that the proof given by Han and Wood of a proper hierarchy in

k

-block deterministic languages based on this result is erroneous. Despite these results, we show by giving a parameterized family that there is a proper hierarchy in

k

-block deterministic regular languages. We also prove that there is a proper hierarchy in

k

-lookahead deterministic regular languages by studying particular properties of unary regular expressions. Finally, using our valid results, we confirm that the family of

k

-block deterministic regular languages is strictly included into the one of

k

-lookahead deterministic regular languages by showing that any

k

-block deterministic unary language is one-unambiguous

arXiv.org e-Print Archive

HAL - Normandie Université

Crossref

Continuity of Functional Transducers: A Profinite Study of Rational Functions

Author: Cadilhac Michaël
Carton Olivier
Paperman Charles
Publication venue
Publication date: 01/02/2020
Field of study

A word-to-word function is continuous for a class of languages~

\mathcal{V}

if its inverse maps

\mathcal{V}

_languages to~

\mathcal{V}

. This notion provides a basis for an algebraic study of transducers, and was integral to the characterization of the sequential transducers computable in some circuit complexity classes. Here, we report on the decidability of continuity for functional transducers and some standard classes of regular languages. To this end, we develop a robust theory rooted in the standard profinite analysis of regular languages. Since previous algebraic studies of transducers have focused on the sole structure of the underlying input automaton, we also compare the two algebraic approaches. We focus on two questions: When are the automaton structure and the continuity properties related, and when does continuity propagate to superclasses

arXiv.org e-Print Archive

Episciences.org

INRIA a CCSD electronic archive server

Directory of Open Access Journals

HAL Descartes

Hal-Diderot

Learning of Structurally Unambiguous Probabilistic Grammars

Author: Fisman Dana
Nitay Dolav
Ziv-Ukelson Michal
Publication venue
Publication date: 09/03/2021
Field of study

The problem of identifying a probabilistic context free grammar has two aspects: the first is determining the grammar's topology (the rules of the grammar) and the second is estimating probabilistic weights for each rule. Given the hardness results for learning context-free grammars in general, and probabilistic grammars in particular, most of the literature has concentrated on the second problem. In this work we address the first problem. We restrict attention to structurally unambiguous weighted context-free grammars (SUWCFG) and provide a query learning algorithm for structurally unambiguous probabilistic context-free grammars (SUPCFG). We show that SUWCFG can be represented using co-linear multiplicity tree automata (CMTA), and provide a polynomial learning algorithm that learns CMTAs. We show that the learned CMTA can be converted into a probabilistic grammar, thus providing a complete algorithm for learning a structurally unambiguous probabilistic context free grammar (both the grammar topology and the probabilistic weights) using structured membership queries and structured equivalence queries. We demonstrate the usefulness of our algorithm in learning PCFGs over genomic data

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Toward a theory of input-driven locally parsable languages

Author: CRESPI REGHIZZI Stefano
Lonati Violetta
Mandrioli Dino
Pradella Matteo
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

If a context-free language enjoys the local parsability property then, no matter how the source string is segmented, each segment can be parsed independently, and an efficient parallel parsing algorithm becomes possible. The new class of locally chain parsable languages (LCPLs), included in the deterministic context-free language family, is here defined by means of the chain-driven automaton and characterized by decidable properties of grammar derivations. Such automaton decides whether to reduce or not a substring in a way purely driven by the terminal characters, thus extending the well-known concept of input-driven (ID) alias visibly pushdown machines. The LCPL family extends and improves the practically relevant Floyd's operator-precedence (OP) languages which are known to strictly include the ID languages, and for which a parallel-parser generator exists

Archivio istituzionale della ricerca - Politecnico di Milano

Crossref

Locally Chain-Parsable Languages

Author: A Barenghi
K Mehlhorn
L Boasson
M Beaudry
MA Harrison
R Alur
R McNaughton
RW Floyd
S Crespi Reghizzi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

If a context-free language enjoys the local parsability property then, no matter how the source string is segmented, each segment can be parsed in- dependently, and an efficient parallel parsing algorithm becomes possible. The new class of locally chain-parsable languages (LCPL), included in deterministic context-free languages, is here defined by means of the chain-driven automa- ton and characterized by decidable properties of grammar derivations. Such au- tomaton decides to reduce or not a factor in a way purely driven by the terminal characters, thus extending the well-known concept of Input-Driven (ID) (visibly) pushdown machines. LCPL extend and improve the practically relevant operator- precedence languages (Floyd), which are known to strictly include the ID lan- guages, and for which a parallel-parser generator exists. Consistently with the classical results for ID, chain-compatible LCPL are closed under reversal and Boolean operations, and language inclusion is decidable

Archivio istituzionale della ricerca - Politecnico di Milano

Crossref

AIR Universita degli studi di Milano