Search CORE

7,171 research outputs found

Terminal context in context-sensitive grammars

Author: Book R. V.
Publication venue
Publication date
Field of study

Context-free language generation using nontrivial constraints on context-sensitive grammar rule

NASA Technical Reports Server

Interval Parsing Grammars for File Format Parsing

Author: Morrisett Greg
Tan Gang
Zhang Jialun
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 10/04/2023
Field of study

File formats specify how data is encoded for persistent storage. They cannot be formalized as context-free grammars since their specifications include context-sensitive patterns such as the random access pattern and the type-length-value pattern. We propose a new grammar mechanism called Interval Parsing Grammars IPGs) for file format specifications. An IPG attaches to every nonterminal/terminal an interval, which specifies the range of input the nonterminal/terminal consumes. By connecting intervals and attributes, the context-sensitive patterns in file formats can be well handled. In this paper, we formalize IPGs' syntax as well as its semantics, and its semantics naturally leads to a parser generator that generates a recursive-descent parser from an IPG. In general, IPGs are declarative, modular, and enable termination checking. We have used IPGs to specify a number of file formats including ZIP, ELF, GIF, PE, and part of PDF; we have also evaluated the performance of the generated parsers.Comment: To appear on PLDI'2

arXiv.org e-Print Archive

On external presentations of infinite graphs

Author: Morvan Christophe
Publication venue: 'Open Publishing Association'
Publication date: 01/01/2009
Field of study

The vertices of a finite state system are usually a subset of the natural numbers. Most algorithms relative to these systems only use this fact to select vertices. For infinite state systems, however, the situation is different: in particular, for such systems having a finite description, each state of the system is a configuration of some machine. Then most algorithmic approaches rely on the structure of these configurations. Such characterisations are said internal. In order to apply algorithms detecting a structural property (like identifying connected components) one may have first to transform the system in order to fit the description needed for the algorithm. The problem of internal characterisation is that it hides structural properties, and each solution becomes ad hoc relatively to the form of the configurations. On the contrary, external characterisations avoid explicit naming of the vertices. Such characterisation are mostly defined via graph transformations. In this paper we present two kind of external characterisations: deterministic graph rewriting, which in turn characterise regular graphs, deterministic context-free languages, and rational graphs. Inverse substitution from a generator (like the complete binary tree) provides characterisation for prefix-recognizable graphs, the Caucal Hierarchy and rational graphs. We illustrate how these characterisation provide an efficient tool for the representation of infinite state systems

arXiv.org e-Print Archive

CiteSeerX

INRIA a CCSD electronic archive server

Directory of Open Access Journals

Calibrating Generative Models: The Probabilistic Chomsky-Schützenberger Hierarchy

Author: Icard Thomas
Publication venue
Publication date: 01/01/2020
Field of study

A probabilistic Chomsky–Schützenberger hierarchy of grammars is introduced and studied, with the aim of understanding the expressive power of generative models. We offer characterizations of the distributions definable at each level of the hierarchy, including probabilistic regular, context-free, (linear) indexed, context-sensitive, and unrestricted grammars, each corresponding to familiar probabilistic machine classes. Special attention is given to distributions on (unary notations for) positive integers. Unlike in the classical case where the "semi-linear" languages all collapse into the regular languages, using analytic tools adapted from the classical setting we show there is no collapse in the probabilistic hierarchy: more distributions become definable at each level. We also address related issues such as closure under probabilistic conditioning

PhilPapers

Graph Grammars, Insertion Lie Algebras, and Quantum Field Theory

Author: Marcolli Matilde
Port Alexander
Publication venue
Publication date: 26/02/2015
Field of study

Graph grammars extend the theory of formal languages in order to model distributed parallelism in theoretical computer science. We show here that to certain classes of context-free and context-sensitive graph grammars one can associate a Lie algebra, whose structure is reminiscent of the insertion Lie algebras of quantum field theory. We also show that the Feynman graphs of quantum field theories are graph languages generated by a theory dependent graph grammar.Comment: 19 pages, LaTeX, 3 jpeg figure

arXiv.org e-Print Archive

CiteSeerX

Caltech Authors

Implicit learning of recursive context-free grammars

Author: Johan J. Bolhuis
Martin Rohrmeier
Qiufang Fu
Zoltan Dienes
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

Context-free grammars are fundamental for the description of linguistic syntax. However, most artificial grammar learning experiments have explored learning of simpler finite-state grammars, while studies exploring context-free grammars have not assessed awareness and implicitness. This paper explores the implicit learning of context-free grammars employing features of hierarchical organization, recursive embedding and long-distance dependencies. The grammars also featured the distinction between left- and right-branching structures, as well as between centre- and tail-embedding, both distinctions found in natural languages. People acquired unconscious knowledge of relations between grammatical classes even for dependencies over long distances, in ways that went beyond learning simpler relations (e.g. n-grams) between individual words. The structural distinctions drawn from linguistics also proved important as performance was greater for tail-embedding than centre-embedding structures. The results suggest the plausibility of implicit learning of complex context-free structures, which model some features of natural languages. They support the relevance of artificial grammar learning for probing mechanisms of language learning and challenge existing theories and computational models of implicit learning

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

Institute of Psychology,Chinese Academy Of Sciences

PubMed Central

Sussex Research Online

FigShare

Tightening the Complexity of Equivalence Problems for Commutative Grammars

Author: Haase Christoph
Hofman Piotr
Publication venue
Publication date: 25/06/2015
Field of study

We show that the language equivalence problem for regular and context-free commutative grammars is coNEXP-complete. In addition, our lower bound immediately yields further coNEXP-completeness results for equivalence problems for communication-free Petri nets and reversal-bounded counter automata. Moreover, we improve both lower and upper bounds for language equivalence for exponent-sensitive commutative grammars.Comment: 21 page

arXiv.org e-Print Archive

CiteSeerX

Dagstuhl Research Online Publication Server