Search CORE

80,870 research outputs found

Multiple Context-Free Tree Grammars: Lexicalization and Characterization

Author: Engelfriet Joost
Maletti Andreas
Maneth Sebastian
Publication venue
Publication date: 11/07/2017
Field of study

Multiple (simple) context-free tree grammars are investigated, where "simple" means "linear and nondeleting". Every multiple context-free tree grammar that is finitely ambiguous can be lexicalized; i.e., it can be transformed into an equivalent one (generating the same tree language) in which each rule of the grammar contains a lexical symbol. Due to this transformation, the rank of the nonterminals increases at most by 1, and the multiplicity (or fan-out) of the grammar increases at most by the maximal rank of the lexical symbols; in particular, the multiplicity does not increase when all lexical symbols have rank 0. Multiple context-free tree grammars have the same tree generating power as multi-component tree adjoining grammars (provided the latter can use a root-marker). Moreover, every multi-component tree adjoining grammar that is finitely ambiguous can be lexicalized. Multiple context-free tree grammars have the same string generating power as multiple context-free (string) grammars and polynomial time parsing algorithms. A tree language can be generated by a multiple context-free tree grammar if and only if it is the image of a regular tree language under a deterministic finite-copying macro tree transducer. Multiple context-free tree grammars can be used as a synchronous translation device.Comment: 78 pages, 13 figure

arXiv.org e-Print Archive

Leiden University Scholary Publications

A Generalised Quantifier Theory of Natural Language in Categorical Compositional Distributional Semantics with Bialgebras

Author: Hedges Jules
Sadrzadeh Mehrnoosh
Publication venue
Publication date: 01/06/2016
Field of study

Categorical compositional distributional semantics is a model of natural language; it combines the statistical vector space models of words with the compositional models of grammar. We formalise in this model the generalised quantifier theory of natural language, due to Barwise and Cooper. The underlying setting is a compact closed category with bialgebras. We start from a generative grammar formalisation and develop an abstract categorical compositional semantics for it, then instantiate the abstract setting to sets and relations and to finite dimensional vector spaces and linear maps. We prove the equivalence of the relational instantiation to the truth theoretic semantics of generalised quantifiers. The vector space instantiation formalises the statistical usages of words and enables us to, for the first time, reason about quantified phrases and sentences compositionally in distributional semantics

arXiv.org e-Print Archive

Queen Mary Research Online

Graph Interpolation Grammars as Context-Free Automata

Author: Larcheveque John
Publication venue
Publication date: 01/01/1998
Field of study

A derivation step in a Graph Interpolation Grammar has the effect of scanning an input token. This feature, which aims at emulating the incrementality of the natural parser, restricts the formal power of GIGs. This contrasts with the fact that the derivation mechanism involves a context-sensitive device similar to tree adjunction in TAGs. The combined effect of input-driven derivation and restricted context-sensitiveness would be conceivably unfortunate if it turned out that Graph Interpolation Languages did not subsume Context Free Languages while being partially context-sensitive. This report sets about examining relations between CFGs and GIGs, and shows that GILs are a proper superclass of CFLs. It also brings out a strong equivalence between CFGs and GIGs for the class of CFLs. Thus, it lays the basis for meaningfully investigating the amount of context-sensitiveness supported by GIGs, but leaves this investigation for further research

arXiv.org e-Print Archive

CiteSeerX

INRIA a CCSD electronic archive server

Controlled Rewriting Using Productions and Reductions

Author: Hogendorp Jan Anne
Publication venue: University of Twente, Department of Computer Science
Publication date: 01/01/1988
Field of study

We investigate context-free grammars the rules of which can be used in a productive and in a reductive fashion, while the application of these rules is controlled by a regular language. We distinguish several modes of derivation for this kind of grammar. The resulting language families (properly) extend the family of context-free languages. We establish some closure properties of these language families and some grammatical transformations which yield a few normal forms for this type of grammar. Finally, we consider some special cases (viz. the context-free grammar is linear or left-linear), and generalizations, in particular, the use of arbitrary rather than regular control languages

University of Twente Research Information

Finite Automata for the Sub- and Superword Closure of CFLs: Descriptional and Computational Complexity

Author: A Okhotin
B Courcelle
C Brabrand
H Gruber
J Esparza
J Leeuwen van
M Mohri
MF Atig
N Rampersad
N Vasudevan
P Ganty
P Habermehl
R Axelsson
S Schmitz
Y Bar-Hillel
Z Long
Publication venue
Publication date: 23/10/2014
Field of study

We answer two open questions by (Gruber, Holzer, Kutrib, 2009) on the state-complexity of representing sub- or superword closures of context-free grammars (CFGs): (1) We prove a (tight) upper bound of

2^{\mathcal{O}(n)}

on the size of nondeterministic finite automata (NFAs) representing the subword closure of a CFG of size

n

. (2) We present a family of CFGs for which the minimal deterministic finite automata representing their subword closure matches the upper-bound of

2^{2^{\mathcal{O}(n)}}

following from (1). Furthermore, we prove that the inequivalence problem for NFAs representing sub- or superword-closed languages is only NP-complete as opposed to PSPACE-complete for general NFAs. Finally, we extend our results into an approximation method to attack inequivalence problems for CFGs

arXiv.org e-Print Archive

CiteSeerX

Crossref

Descriptional Complexity of Three-Nonterminal Scattered Context Grammars: An Improvement

Author: A. Meduna
A. Meduna
A. Meduna
A. Meduna
A. Salomaa
Alexander Meduna
Bianca Truthe
D. Kol'av r
Giovanni Pighizzini
Gy. Vaszil
Jürgen Dassow
L. Rychnovsk'y
S. Greibach
T. Masopust
T. Masopust
Tomáš Masopust
V. Geffert
Publication venue: 'Open Publishing Association'
Publication date: 01/07/2009
Field of study

Recently, it has been shown that every recursively enumerable language can be generated by a scattered context grammar with no more than three nonterminals. However, in that construction, the maximal number of nonterminals simultaneously rewritten during a derivation step depends on many factors, such as the cardinality of the alphabet of the generated language and the structure of the generated language itself. This paper improves the result by showing that the maximal number of nonterminals simultaneously rewritten during any derivation step can be limited by a small constant regardless of other factors

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Polynomial Time Algorithms for Multi-Type Branching Processes and Stochastic Context-Free Grammars

Author: Etessami Kousha
Stewart Alistair
Yannakakis Mihalis
Publication venue
Publication date: 01/01/2012
Field of study

We show that one can approximate the least fixed point solution for a multivariate system of monotone probabilistic polynomial equations in time polynomial in both the encoding size of the system of equations and in log(1/\epsilon), where \epsilon > 0 is the desired additive error bound of the solution. (The model of computation is the standard Turing machine model.) We use this result to resolve several open problems regarding the computational complexity of computing key quantities associated with some classic and heavily studied stochastic processes, including multi-type branching processes and stochastic context-free grammars

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer